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Incidental benefits 


Scientists who screen the genes of volunteers for research should tell participants if they 


find information relevant to their health. 


volunteer's health — at initial screening, during the study itself, 
or even after the study finishes, when other researchers review 
the data or conduct their own analyses. 

For the most part, researchers have opted not to reveal these poten- 
tially important ‘incidental findings’ to participants. This has been 
to protect the research process, and to prevent coercing people into 
studies by unwittingly eliciting the ‘therapeutic misconception’ — the 
incorrect assumption on the individual’s part that participating in a 
study will help their own health. 

But the emergence of high-throughput genomics, with its ability to 
catalogue vast amounts of information that may have a bearing ona 
person’ health, has prompted a rethink of this convention. 

A working group funded by the US National Institutes of Health 
(NIH) in Bethesda, Maryland, has recommended that biobanks and 
archives that house large genetic data sets introduce policies to encour- 
age the return of incidental findings to research subjects (see page 387). 

The impact of these recommendations should not be under- 
estimated. As genetics invades every branch of medicine, no fieldis likely 
to be exempt from ethical standards introduced to cover genetic data. 
The recommendations are likely to have their most immediate influ- 
ence in discussions on the topic now under way at the US Department 
of Health and the NIH. 

There is a precedent for returning information to the subjects of 
a study. In imaging studies, for instance, a radiologist often reviews 
patients’ scans for incidental findings before they are analysed by 
researchers. Support is growing for the idea that genetics researchers 
should similarly review a selected set of genes with known impacts on 
health before undertaking their own research. 

Implementing this will not be easy. Defining appropriate sets of genes 
is problematic, and any list will need to be constantly updated. And to 
return the information to an individual in a way that avoids unneces- 
sary anxiety and medical expense is a huge issue. Opponents of the idea 
point out that it contravenes standard practice in medicine itself, because 
doctors do not routinely sift through patients records for old test results 
that may carry new significance in the light of more recent research. 

Perhaps the most visible example of the need for this debate comes 
not from science, but from commerce. Companies routinely mine vast 
quantities of consumer information to influence marketing decisions. 
Governments have not been able to keep pace with standard business 
practices, and most consumers are unaware of the breadth and depth 
of information that companies gather on them from Internet searches, 
social networks and supermarket purchases. 

In testimony before the US Presidential Commission on Bioethics in 
February, John Wilbanks, who runs the project Consent to Research, 
noted that this pattern is likely to repeat itself in the era of electronic 
medical records and genomics. “My sad realization is that whether it’s 
your genome or your health information, anyone who really wants to 


A ll research studies on humans can uncover facts relevant to a 


screw you will probably be able to get a copy of the data they need to 
do so, and the people who are least likely to get a copy are the peo- 
ple who can do something amazing with it, like researchers,’ he said. 
Companies are lining up to market products to consumers on the basis 

of their genomes. Law-enforcement agencies 


“No fie Idis like ly already use DNA left at crime scenes to find 
to be exempt suspects and their relatives, and are funding 
from ethical programmes to create physical profiles of 
standards suspects on the basis of their DNA. In other 
introduced to words, people now have incomplete pro- 
cover genetic tection for their own DNA, and this lack of 
data.” privacy is likely to increase in the future. 


In this free market, how sure can research- 
ers be that they are truly doing no harm to their study participants 
when they take a cheek swab? People thinking of entering a study will 
assess the risks of how their volunteered genetic information might 
be used, and this might make them more reluctant to participate. 
Researchers could help to counter this by offering them medically 
relevant information back in exchange. m= 


Flight risk 


As the campaign against animal research 
intensifies, so must the response. 


photographs of stroke victims and sufferers of Parkinson's 

disease. They are demonstrating outside the corporate head- 
quarters of British Airways, Lufthansa and Delta, demanding that the 
airlines stop impeding the biomedical research that could deliver big 
advances against these and other diseases. 

Seem far-fetched? Maybe. But if scientists want continued access 
to animals as research models, they will have to appear on the front 
line with every bit as much visibility, determination, organization and 
persistence as animal-rights activists now muster. 

In arenewed campaign targeting transportation companies, pro- 
testors have found a public pressure point so effective that only a few 
major airlines still agree to transport non-human primates bound for 
research labs (see page 381). Nor is the focus confined to primate 
transportation: earlier this year, the last ferry company that was willing 
to carry research rodents into the United Kingdom stopped doing so. 
Such blocks, scientists warn, could shift much animal work to coun- 
tries where regulations are more lax. 

But there is a silent majority for whom the activists do not speak. 


Priv a crowd of scientists waving placards plastered with 
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This includes most scientists, many ordinary citizens and millions of 
patients and their families touched by disease and injury. This con- 
stituency must mobilize in the defence of human health if the gains 
enabled by future primate research — and, ultimately, all animal 
research — are not to be thrown away. 

Scientists and their allies must, of course, continue to be open about 
the price animals pay in research. They must openly acknowledge, 
immediately correct and do everything they can to prevent lapses in 
the care of the animals in their charge. At the same time, scientists 
must make every effort to use lower animal, and non-animal, models 
where possible, as regulations already require. And alongside all of 
that, they must emphasize the tangible and compelling improvements 
to human life that animal research has made possible. 

Consider stroke, which affects some 795,000 people in the United 
States alone each year — 1 person every 40 seconds — at a cost in 
excess of US$40 billion. More than 1,000 experimental treatments 
aimed at protecting brains cells in acute stroke have been developed 
in cells and rodents; none has been effective in humans. So a possi- 
ble advance in a paper published last month is significant (D. J. Cook 
et al. Nature 483, 213-217; 2012). Using macaques — animals whose 
neuroanatomy, genetics and behaviour are far closer to humans than 
are those of rodents — the study showed that a drug called a PSD-95 
inhibitor reduced the volume of brain tissue killed by the stroke and 
significantly preserved neurological function. It has now moved into 
human trials, where early results are promising. 

Many other advances have been made possible by primate research. 
Trials of brain-machine interfaces that allow quadriplegics to control 
robotic limbs with their minds, for example, and of gene therapy for 
haemophilia B. And deep-brain stimulation to alleviate the symptoms 


of advanced Parkinson's disease. Primates have also helped the devel- 
opment of antiviral therapies to prevent mother-to-child transmission 
of HIV; of Rituxan (rituximab), a key drug against non-Hodgkin's lym- 
phoma; and of Benlysta (belimumab), the first new lupus treatment 
in 50 years. Primates are making possible strides towards a vaccine 
against Ebola, and they have affirmed that the diabetes drug pioglita- 

zone can slow early Parkinson's disease — a 


“Silence from therapy now being taken into human trials. 

the research Future advances of this kind cannot be 
community will allowed to stall. The scientific community 
mean lost access  anditsallies in patient and research advocacy 
to research groups should mount a vigorous, coordinated 


campaign to lobby major airlines. They should 
vocally support companies such as Air France 
that continue to fly research primates. They should confront others with 
the human cost of their alignment with animal activists. And to those 
airlines that are wavering, they should make a pointed, persuasive case. 
An apt target in the last category is United Airlines, which merged 
with Continental in 2010. Continental was the last major US airline 
carrying non-human primates for research. The now-merged airline, 
still under the name of United, says it is “in the process of integrating” 
the former United and Continental policies on transporting research 
primates, for publication this autumn. Only a powerful and visible 
public campaign urging United to transport research primates will 
give the firm the political cover it needs to take such a position. 
Urgent and dramatic action is necessary. It is increasingly clear 
that silence and passivity from the research community will lead to 
only one result: lost access to research animals in the countries best 
equipped for their responsible, humane and justified use. = 


animals.” 


Mass appeal 


As physicists close in on the Higgs boson, they 
should resist calls to change its name. 


— physicists besides Peter Higgs contributed to the theory that 
predicts it. But it is far from the most extreme example. 

The relationship between the velocity of recession of galaxies and 
their distance from Earth that we call Hubble’s law was first formu- 
lated by Belgian cosmologist Georges Lemaitre. The quantity known 
as Avogadros number was first calculated by Austrian chemist Johann 
Josef Loschmidt. Higgs, a physicist at the University of Edinburgh, 
UK, at least has the strongest claim to credit for the boson. And for the 
arcane world of particle physics, simplification is often a good thing. 

In 1964, Higgs was the first to postulate the existence of a massive 
particle arising from the mechanism of electroweak symmetry breaking, 
in which a unification of electromagnetism and the weak nuclear force 
fails in such a way as to give some force-carrying particles masses while 
others remain without. Yet moves are afoot to rename the Higgs. Earlier 
this month, some speakers at the Moriond particle-physics conference 
in La Thuile, Italy, chose to describe progress on the search for the BEH 
scalar boson (after the physicists Robert Brout, Francois Englert and 
Higgs) or the SM scalar boson (where SM stands for standard model). 

Itis not hard to guess why. Experiments at the Large Hadron Collider 
at CERN, Europe's premier particle-physics laboratory near Geneva 
in Switzerland, have reported tentative signals of the Higgs. If these 
are real, data collected in 2012 should see CERN claim a discovery. A 
Nobel prize is in the offing, and one that should arguably go not just 
to the experimentalist, but also to the theorists whose efforts inspired 
the successful search. But who — besides Higgs — should be included? 

The name Higgs boson was supposedly coined by Korean-born 


ik naming of the Higgs boson as such is clearly a simplification 


374 | NATURE | VOL 483 | 22 MARCH 2012 


physicist Ben Lee, some time between 1966 and 1972. According to jour- 
nalist Ian Sample, Lee learned about the mass-giving mechanism from 
Higgs, and later coined the term asa shorthand. Yet several others played 
an important part in developing the theory that gives rise to the Higgs. 
Belgian physicists Brout and Englert were the first to publish on 
the subject in 1964, building on ideas from condensed-matter phys- 
ics developed by physics Nobel laureate Phil Anderson and others. 
Higgs published the same year; his paper contains the first mention 
of the boson. Tom Kibble, Gerald Guralnik and Carl Hagen followed 
with a third account that is generally considered more complete. One 
interpretation of this history is reflected in the American Physical 
Society’s joint award, in 2010, of the J. J. Sakurai Prize for Theoretical 
Particle Physics to Brout, Englert, Higgs, Guralnik, Hagen and Kibble. 
This provides plenty for physicists to argue about. In 2010, a row over 
credit erupted when Brout, Englert and Higgs were acknowledged in an 
advertisement for a conference on the particle but Guralnik, Hagen and 
Kibble were not (see Nature http://doi.org/ctz988; 2010). Meanwhile, 
2011 saw a dispute over editing of the Wikipedia article ‘Higgs bosor’ 
between one editor who supported Guralnik’s view that his paper with 
Hagen and Kibble proposed a boson and another who was pro-Higgs. 
Particle physicists should not rename the Higgs. And the reason 
would be obvious to anyone in business: branding. There are already 
relatively few concepts in their subject that have achieved widespread 
recognition without crossing one of them out. In business, it would be 
considered destructive to take a well-known name and replace it with a 
long-winded, technical-sounding alternative that no one has heard of. 
Correct allocation of credit is important, and authoritative accounts of 
the history of science are useful and enlightening, but both must be bal- 
anced with science’ need for consistent conventions, brevity, and public 
communication and outreach, especially when taxpayer's expenditures, 
such as the US$6.5 billion for the Large Hadron 
Collider, are at stake. Renaming the Higgs boson 
in the year when it is most likely to be found gets 
the balance wrong. (And anyway, Higgs is a better 
name than the God particle, isn’t it?) m 
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— the United Nations Conference on Sustainable Development 
(Rio+20) in Rio de Janeiro, Brazil, in June — officials and politi- 
cians are calling for further assessments of our global ecological plight. 
In January, for example, a panel on global sustainability set up by 
UN secretary-general Ban Ki-moon recommended a “periodic global 
sustainable development outlook report that brings together infor- 
mation and assessments currently dispersed across institutions and 
analyses them in an integrated way”. 

This is a response to research that shows how global society is 
increasingly interconnected and interdependent. The cascading effect 
on land availability and food security of a switch to biofuels, for exam- 
ple, demonstrates how actions to address carbon dioxide emissions 
can rebound on other goals. 

But, in these difficult times, can the thinly 
stretched scientific community support a new 
assessment process? And is that really what 
policy-makers need from research? 

Scientists are already busy on policy-makers’ 
behalf. There is the Intergovernmental Panel 
on Climate Change (IPCC), the Millennium 
Ecosystem Assessment, assessments of inter- 
national waters, mountains and fresh water, the 
Global Marine Assessment and the important 
new Intergovernmental Platform on Biodiver- 
sity and Ecosystem Services. Each has a crucial 
role in consolidating knowledge about individual 
sectors. But how to connect the dots? 

In many areas, the rates of global environmen- 
tal change are accelerating but decision-making 
processes are stuck in low gear. It is not clear 
that another conventional assessment will catalyse swifter action. So, 
although the research community should rally behind an integrated 
analysis, it must be done differently. 

First, the focus must shift from documenting problems to support- 
ing solutions. This requires strong and continual interaction between 
those working in strategic applied research and decision-makers in 
policy, industry and civil society, both on specific decisions (such as 
how to frame a particular trade agreement) and on the wider context 
(interactions between national well-being, environmental outcomes 
and economic flows). 

Second, the process must promote responses at all scales, from 
national governments and regional groups to UN institutions. Appro- 
priate solutions will differ from region to region, be they specific 
technologies for energy production or carbon 


A s the world heads towards the next big environmental summit 


sequestration, or analyses that jointly address NATURE.COM 

water, energy and food. Discuss this article 
Finally, the process must work across sectors, _ online at: 

through simultaneous analysis ofthe impact ofa __go.lature.com/79fhuu 


THE FOCUS MUST 
SHIFT FROM 


DOCUMENTING 
PROBLEMS 


TO 
SUPPORTING 
SOLUTIONS. 


Change the approach to 
sustainable development 


Conventional environmental assessments are not enough — itis time for some 
joined-up global thinking, says Mark Stafford Smith. 


migration policy on environmental and social well-being, for example. 
To do this comprehensively, the research must also become more inte- 
grated, encompassing natural and social sciences and the humanities 
to understand the implications of changes. 

How could this be done? Two proposals already on the table for 
Rio+20 could help: a UN Sustainable Development Council (UNSDC), 
directly answerable to the UN General Assembly, and a set of sustaina- 
ble-development goals (SDGs). 

AUNSDC could commission strategic analyses of global sustainabil- 
ity and set up and coordinate decision-specific panels — small mixed 
working groups that include non-scientific members, appointed to 
report quickly on specific issues. Such panels would work across sec- 
tors, independent from but jointly owned by global bodies such as the 
Food and Agriculture Organization of the UN, 
the World Trade Organization, the UN Environ- 
ment Programme and the World Bank. 

This model must be replicated at regional and 
national levels, driven by local needs, with local 
ownership. Light-touch coordination by the 
UNSDC would ensure good communication 
and exchange of ideas, and would make sure that 
activities in one region did not lead to perverse 
global outcomes — perhaps by causing people to 
move, distorting prices or over-using resources. 

The SDGs could ensure that these activities 
integrate the three pillars of sustainability — 
environmental, economic and social — instead 
of dealing with each in isolation, as the UN’s cur- 
rent Millennium Development Goals do. They 
should connect sectors, aiming for example to 
improve well-being without environmental dam- 
age, ensure food security without undermining local livelihoods and 
develop habitable urban environments without increasing resource use. 

All of this builds on existing trends in the activities of bodies such as 
the IPCC, but we need a rapid step change in the evolving relationship 
between science and decision-making. 

Countries such as Australia already talk about ‘national innovation 
systems’ — the totality of their pure and applied research efforts and 
the interactions of these with decision-making in industry and govern- 
ment. Itis time to embrace a global innovation system to support better- 
coordinated and more nimble decision-making on global sustainability 
at all scales. Much work needs to be done on the details, but if science is 
to be genuinely useful to society, this is what we must fight for. m 


Mark Stafford Smith is science director of the Commonwealth 
Scientific and Industrial Research Organisation Climate Adaptation 
Flagship in Canberra, and co-chair of the Planet Under Pressure 
conference, which will be held in London next week. 

e-mail: mark.staffordsmith@csiro.au 
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Hydrogen can be 
stored as acid 


Hydrogen gas holds promise 
as a carbon-free fuel, but 

is difficult to store and 
transport because it is highly 
flammable, diffuse and has a 
low energy density. Jonathan 
Hull at Brookhaven National 
Laboratory in New Yorkand 
his colleagues have found a 
possible way to store hydrogen 
as an aqueous solution of 
formic acid (HCOOH), which 
has a higher energy density. 

At ambient temperature and 
pressure, an iridium catalyst 
dissolved in water causes H, 
to react with carbon dioxide 
to form the acid under mildly BIOENGINEERING 
basic conditions. Acidifying 
the solution triggers the release A | ti t il r 
of pure pressurized H, gas. pu Sa | i 4 Sul 0 | a C | p 


Because of the mild conditions 


needed for the reactions, the A coin-sized device created by a team at simulated gut contractions, or peristalsis, by 
work could eventually lead to a Harvard University mimics the structure applying suction through two side chambers. 
mechanism for H, storage. and physiology of the human intestine by In response, the epithelial cells formed folds 
Nature Chem. http://dx.doi. supporting gut microbes and imitating the similar to the finger-like protrusions, or villi, 
org/10.1038/nchem.1295 (2012) organ’s rhythmic motion. that line the inner intestinal wall. 
Donald Ingber and his colleagues at the The gut tissue layer blocked the flow of small 
Wyss Institute in Boston, Massachusetts, built molecules between the channels, and this 
a ape the chip (pictured) out of a clear polymer. barrier function improved with the presence of 
Slici ng si licon It contains two microscopic fluid channels the bacteria. The authors say that their device 
with less waste separated by a porous, flexible membrane. is a better intestinal mimic than cells in static 
Human gut epithelial cells, which line the gut’s culture and suggest that it could be used for 
Cutting silicon blocks into surface, cover the membrane and supported drug screening and toxicity tests. 
thin wafers for solar cells and the growth of acommon gut bacterium, Lab Chip http://dx.doi.org/10.1039/C2LC40074J 
other applications generates Lactobacillus rhamnosus. The researchers (2012) 


a lot of wasted metal because 


of the width and vibration of chemicals to dissolve those ZOOLOGY 


the mechanical sawing wire parts of the silicon away. 

typically used. Sungho Jin at Strong neodymium magnets All the better to 
the University of California, guide the chemical etching, see whales with 
San Diego, and his colleagues allowing the researchers to 

report a way to etch silicon slice silicon in any desired The giant eyes of the world’s 
into various intricate shapes direction, forming sheets, largest squid seem to be 

that produces an order of microneedles, nanowires specially adapted to spot 
magnitude less waste than (pictured) and tunnels. approaching predatory sperm 
conventional approaches. Magnetically guided chemical whales. 

The authors adapted a etching makes thinner cuts Dan-Eric Nilsson at Lund 
common etching technique, than mechanical sawing, thus University in Sweden and 
depositing a catalytic gold reducing waste. his colleagues examined a 
and iron layer on top of Nano Lett. http://dx.doi. photograph of an adult giant 
certain sections ofa silicon org/10.1021/n1300141k squid (Architeuthis sp.) and 
wafer and using other (2012) a full-size colossal squid 
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(Mesonychoteuthis hamiltoni) 
specimen, and found both 
animals’ eyes to measure about 
27 centimetres in diameter — 
the largest eyes of any species. 
The researchers created a 
mathematical model relating 
eye size to vision. They found 
that the only advantage of 
such large eyes over smaller 
ones is an enhanced ability to 
detect large, luminous objects 
below ocean depths of about 
500 metres. 

The authors suggest that 
these squid evolved such eyes 
to detect the bioluminescence 
that can be created by the 
movement of hunting whales. 
Curr. Biol. http://dx.doi. 
org/10.1016/j.cub.2012.02.031 
(2012) 


Bacteria signal 
to survive 


Bacterial cells that are 
genetically identical to 
members of their population 
that succumb to antibiotics 
can survive in a dormant 
state, thanks to chemical 
communication between 
bacteria. 

Surviving ‘persister’ cells 
have been implicated in 
chronic infections such as 
tuberculosis. James Collins 
and his 
team at 
Boston 
University in 
Massachusetts gave 
Escherichia coli the 
chemical indole, which 
the bacteria produce as a 
signalling molecule. Indole- 
treated E. coli were able 
to withstand higher levels 
of several antibiotics than 
untreated bacteria. Moreover, 
the individual bacterial cells 
in aculture that were most 
responsive to indole were 
also those most resistant to 
antibiotics. 

Indole activates genes 
involved in responding to 
stress, and E. coli strains 
lacking stress-response genes 
produce fewer persisters. 
Nature Chem. Biol. http://dx.doi. 
org/10.1038/nchembio.915 
(2012) 


Predicting 
realistic rains 


A high-resolution climate 
model has produced the most 
accurate representation yet 
of rainfall in the southern 
United Kingdom, bolstering 
confidence in the potential of 
detailed regional modelling. 

A team led by Elizabeth 
Kendon at the Met Office 
Hadley Centre in Exeter, UK, 
used a regional model with a 
1.5-kilometre grid to conduct 
detailed simulations from 
1989 to 2008. The authors 
analysed the duration, extent 
and intensity of rainfall and 
compared the results of their 
model with those produced by 
a less-detailed 12-kilometre 
regional climate model. 

The results suggest that 
the 1.5-kilometre model 
significantly improves the 
representation of heavy rains 
and reduces long-standing 
inaccuracies that produced 
too much light rain. 
J. Clim. http://dx.doi.org/ 
10.1175/JCLI-d-11-00562.1 
(2012) 


EVOLUTION 


No sweetness for 
meat-eaters 


Many meat-eating animals 
have lost their ability to taste 
sugars, having lost a working 
copy ofa gene that encodes a 
taste receptor for sugar. 
Peihua Jiang and Gary 
Beauchamp at the Monell 
Chemical Senses Center in 
Philadelphia, Pennsylvania, 
and their co-workers 
sequenced DNA from 
12 members of the order 
Carnivora, including spotted 
hyenas and several sea 
mammals. Seven of the species 
carried a malfunctioning 
copy of the Tas1r2 gene that 
encodes a sweet taste receptor. 
However, the exact mutations 
differed from one species 
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Testosterone hinders collaboration 


> HIGHLY READ 
on rspb.royalso 
cietypublishing 
org in February 


Women taking testosterone pills 
collaborate less effectively than those given 
a placebo. 

Nicholas Wright and his colleagues at 


University College London asked 17 pairs 
of women to decide individually if a set of striped circles on 
one screen was brighter than those ona second screen. If the 
women within a pair disagreed, they had to collaborate to 


come up with a joint decision. 


The team found that the women were generally more 
accurate when they collaborated than when working 
alone. However, testosterone decreased this boost in the 
women’s performance compared with placebo, even though 
the hormone did not affect the accuracy of the decisions 
made individually. The authors suggest that the hormone 
causes individuals to place much more weight on their own 


judgment than on that of others. 


Proc. R. Soc. B http://dx.doi.org/10.1098/rspb.2011.2523 (2012) 


to another, suggesting that 
carnivores have independently 
lost their ability to detect 
sugars during the course of 
evolution. 

Furthermore, bottlenose 
dolphins (pictured) and sea 
lions lack working 
copies of the gene 
encoding receptors 
for tasting savoury 

flavours, or umami. 
Dolphins also seem 
to have lost a receptor that 
senses bitter compounds. The 
authors say taste may not affect 
what these creatures eat as 
sea lions and dolphins tend to 
swallow their food whole. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1118360109 (2012) 
For alonger story on this 
research, see go.nature.com/ 
y4tzk7 


Coming surge in 
storm surges 


As the climate warms and sea 
levels rise, the frequency of 
local extremes in storm surges 
will increase along much of the 
US coastline. 

To assess changes in local 


flood risk, Claudia Tebaldi at 
Climate Central in Princeton, 
New Jersey, and her colleagues 
combined projections from 

a model of global sea-level 

rise with long-term records 
from 55 tidal gauges around 
the United States. The team 
estimates that by 2050, one- 
third of gauge locations will see 
an increase in the frequency of 
extreme high-water levels that 
are currently expected to occur 
only about once a century. 
Some locations can expect to 
see these extremes, on average, 
every ten years, others even 
annually. 

Ina separate study, 
Benjamin Strauss, also at 
Climate Central, and his 
colleagues assessed US 
communities’ topographic 
vulnerability to sea-level rise. 
Given that sea level could 
increase by one metre or more 
during this century, the team 
estimates that 3.7 million 
people live within one vertical 
metre of local mean high tide. 
Environ. Res. Lett. 7,014032; 
014033 (2012) 
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No faster than light 


Neutrinos don't travel faster 
than light, according to results 
from the ICARUS experiment 
at the Gran Sasso National 
Laboratory near LAquila, 

Italy. The experiment found 

no light-busting speed when 

it timed 7 neutrinos as they 
travelled about 730 kilometres 
from CERN, Europe’ particle- 
physics lab near Geneva, 
Switzerland — contradicting 
last September's results from 
the OPERA experiment, also at 
the Gran Sasso lab. OPERA had 
recorded neutrinos travelling 
faster than light, but has since 
uncovered possible problems 
with its measurement. The 
ICARUS report was posted to 
the preprint server arXiv.org on 
15 March (M. Antonello et al. 
http://arxiv.org/abs/1203.3433; 
2012). See go.nature.com/ 
go4247 for more. 


Early drug screening 
The newest centre at the US 
National Institutes of Health 

— the National Center for 
Advancing Translational 
Sciences (NCATS) in Bethesda, 
Maryland, which aims to speed 
new drugs to market — has 
announced its first major deal. 
NCATS will partner with 

drug firm Eli Lilly, based in 
Indianapolis, Indiana, to screen 
around 3,800 approved and 
investigational medicines to 
find potential new uses. The 
two organizations announced 
on 13 March that the screening 
will take place over the next 
12-18 months and the results 
will be posted publicly at 
http://tripod.nih.gov/npce. 

See go.nature.com/rgudpy 

for more. 


LHC restarts 


The world’s most powerful 
particle accelerator has begun 
warm-up stretches for its 2012 
run. On 14 March, beams 

of protons again began to 


mS 


Scientists join protest over Spanish cuts 


Scientists have warned Spain's government 
that its planned budget cuts and hiring freeze 
could be fatal for national research. Along 
with widespread general protests (pictured, 
in Madrid) this month against budget cuts 
and labour reforms, more than 36,000 people 
(including three Nobel laureates) and 

65 organizations have signed an ‘open letter 


circulate in the Large Hadron 
Collider (LHC) at CERN, 
Europe's particle-physics lab 
near Geneva, Switzerland. 
Full-scale collisions at 

8 teraelectronvolts should 
begin in April, and scientists 
could confirm the detection 
of the long-sought-after 
Higgs particle by the end of 
the year. 


Children’s study row 
Two advisers to the US 
National Children’s Study 

— which aims to track 
environmental influences on 
the health of 100,000 children 
— have resigned in less than 

a fortnight. Ellen Silbergeld, 
an environmental scientist at 
Johns Hopkins Bloomberg 
School of Public Health in 
Baltimore, Maryland, resigned 
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on 5 March, followed on 

16 March by biostatistician 
Jonas Ellenberg of the 
University of Pennsylvania's 
Perelman School of Medicine 
in Philadelphia. They 

and other scientists are 
complaining about an abrupt 
change in sampling design, in 
which the study will recruit 
subjects from the offices of 
physicians rather than taking 
a representative sample of the 
US population. See go.nature. 
com/wb9sh8 for more. 


Biomedical awards 
The Howard Hughes Medical 
Institute in Chevy Chase, 
Maryland, is investing 
US$200 million to fund up to 
30 US biomedical researchers 
for five years. The 15 March 
announcement marks the first 
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for science in Spain; which was presented to 
members of parliament at the Spanish Royal 
Academy of Sciences in Madrid on 16 March. 
Research minister Carmen Vela says that the 
government is planning to cut €743 million 
(US$978 million) from the science budget, a fall 
of 9% from last year; a draft budget is due on 

30 March. See go.nature.com/hcarmz for more. 


time the institute has held a 
general competition for its 
coveted investigator awards 
since 2007. See go.nature.com/ 
j2a3xt for more. 


| _BUSINESS 
Academic pharma 


In the latest union between 
academia and industry, drug 
giant Merck has announced 
that it will create a non- 

profit research entity called 
the California Institute for 
Biomedical Research (Calibr), 
to be located in San Diego, 
California. The institute plans 
to hire about 150 scientists 
and will be headed by chemist 
and serial entrepreneur Peter 
Schultz, currently at the 
Scripps Research Institute 

in La Jolla, California. 
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Academics from around 

the world will be able to 
submit proposals for research 
at Calibr. See go.nature. 
com/26fij7 for more. 


UK venture capital 
The Wellcome Trust, 

the world’s largest non- 
governmental funder 

of biomedical research, 
announced on 20 March that 
it would launch a £200-million 
(US$371-million) business 
to invest in health-care and 
life-sciences companies. 

The charity — which awards 
around £600 million in 
research grants each year 

— says that in the past it has 
invested in biotech companies, 
and that £148 million of 

£2.7 billion it invested in 
venture-capital firms in the 
year to 30 September 2011 
already goes to life-sciences 
and health-care businesses. 
But the new fund marks its 
largest direct effort to invest 
in biotech start-ups. 


POLICY 


Rare-earth trade 
The United States, Europe 
and Japan have jointly 
complained to the World 
Trade Organization that China 
is restricting exports of rare- 
earth elements. The elements 
are quite plentiful, but miners 
in China (pictured, smelting 
lanthanum) produce more 
than 90% of the supply. They 


TREND WATCH 


Junior researchers are being 
squeezed out of Japanese 
universities by government 
policies that have trimmed 
hiring. The claim is made in a 
draft report on Japan's science 


and technology activities from 


the Council for Science and 


Technology Policy, the nation’s 
top advisory body on science. 

It points out that the number of 
faculty members under 35 has 
plunged from more than 10,000 
to 6,800 in the past 30 years (see 
chart). See go.nature.com/bidelg 
for more. 


are used as catalysts and 

in high-tech magnets, car 
batteries, wind turbines and 
mobile phones. The formal 
complaint — which could 
result in sanctions — was 
made on 13 March, and 
follows years of disputes about 
China's export quotas. 


Misconduct in China 


China's education ministry 

has launched an effort to 

curb scientific misconduct 

at universities. Rules 
announced on 14 March 
require universities to create 
mechanisms for inspecting 
original data, and for getting 
departing scientists to hand 
over materials produced in 
experiments. Universities must 
also create committees with five 
or more experts to investigate 
academic misconduct, and 
arrange ethics training. Various 
Chinese organizations have 
previously issued guidelines 

to stem scientific misconduct, 
but many scientists complain 
that the measures are not 
implemented and cheaters 
remain unpunished. A few 


GOING GREY 


universities, notably Zhejiang 
University in Hangzhou, 
have stepped up efforts to be 
stringent (see Nature 481, 
134-136; 2012). 


SESAME funding 

A US$110-million 
synchrotron is still on track for 
construction near Amman, 
Jordan, despite political 
upheaval in the Middle East. 
Iran and Israel announced 

on 13 March that they had 
joined Jordan and Turkey in 
committing funding to the 
project, known as SESAME 
(Synchrotron-light for 
Experimental Science and 
Applications in the Middle 
East). Partners have now 
promised $20 million in total, 
and SESAME is looking for 
$15 million more to complete 
its first four beamlines by 
2015. See page 385 for more. 


Animal shipments 
UK scientists warned last week 
that pressure from animal- 
rights activists has led all ferry 
companies and many airlines 
to stop transporting research 
animals (including mice) into 
the United Kingdom. Less 
than 1% of UK experiments 
use animals from abroad. 
Science minister David Willetts 
said that the government is 
working with airlines and ferry 
companies to try to tackle the 
problem, perhaps by getting 
researchers and industry to 
agree on codes of conduct for 


Despite a growth in the number of university faculty members 
in Japan, there are fewer opportunities for young researchers. 
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SEVEN DAYS [IMs 
COMING UP 


25-29 MARCH 

The ‘chemistry of life’ — 
including topics such as 
the biochemical basis of 
epigenetics — is the key 
topic at the American 
Chemical Society's 
spring meeting in 

San Diego, California. 
go.nature.com/ysth8 


26-29 MARCH 
‘Planet under Pressure’ 
will see scientists and 
policy-makers meet 

in London to discuss 
the latest research 

on ecosystems and 
planetary stewardship. 
go.nature.com/xtqrnh 


28 MARCH 

Swiss drug firm 
Novartis’ six-year 
legal fight with India’s 
government, which 
began when the firm 
was denied a patent on 
acancer drug, goes to 
India’s supreme court. 
The ruling could set 
precedents for patented 
drugs in India and many 
developing countries. 
go.nature.com/aliugr 


animal transportation. For 
more on the issue of airlines 
and primate transport, see 
pages 373 and 381. 


Indian budget 


Researchers in India were 

not impressed by a moderate 
increase in the country’s 
science budget, announced on 
16 March. Rises varied between 
2% and 18% for scientific 
departments, but, with 
inflation running at 7%, there 
will be little extra real spending 
power — even though Prime 
Minister Manmohan Singh had 
promised in January that there 
would be a “major increase” 

in research spending. See 

page 384 for more. 
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NEWSIN FOCUS 


Opioid Giving Teaming ’ Joe Thornton, the 
receptors revealed in double researchers the right tools for mice and men for better man who resurrects 
exposure p.383 the job p.388 clinical trials p.389 extinct proteins p.390 


BY MEREDITH WADMAN 


ach year, thousands of macaques and 
Be: monkeys are flown into Europe 
and North America to supply academic 
and industrial research labs — more than 18,000 
to the United States in 2011 alone. But ina 
campaign that could affect scientists across the 


CHRISTOPHER J. MORRIS/AP 


7 4 West, the few major air carriers that still trans- 
’ 


Fe 


port non-human primates are coming under 
unprecedented pressure to halt the practice. 
One key route under threat is from China, 
which last year shipped more than 70% of the 
research primates sent to the United States (see 
‘Up in the air’). On 14 March, animal breeders 
in China met with officials of China Southern 
Airlines to implore them to resume flights into 
Los Angeles International Airport, the largest 
US port of entry for research primates. Last 
August, China Southern cancelled a shipment 
from Guangzhou of 80 crab-eating macaques 
(Macaca fascicularis) destined for Los Angeles, 
after a social media, e-mail and telephone cam- 
paign by pressure group People for the Ethical 
Treatment of Animals (PETA), based in Nor- 
folk, Virginia. China Southern has not flown 
research primates into Los Angeles since then. 
“This was part of our larger campaign to dis- 
rupt the flow of primates to US labs,’ says Justin 
Goodman, associate director of the laboratory 
investigations department at PETA in Washing- 
ton DC. The group complains that imports are 
supplementing an already-burgeoning primate 
population in US labs (see ‘Frequent flyers’). 
Two other airlines are also in the public 
spotlight. Air France faces mounting pressure 
as the last major European carrier to transport 
research primates. And Air Canada is petition- 
ing the Canadian Transportation Agency, the 
body that regulates Canadian air carriers, for 
Protesters make their mark at an Animal Transportation Association meeting in Vancouveron18 March. _ the required permission to refuse to transport 
research primates in the future. With news 
breaking last week that ferry companies have 
entirely ceased transporting all research ani- 


e e mals — including sophisticated mouse disease 
C IV1S S [! oun models — into the United Kingdom, research- 
ers fear that this is the start ofa larger trend. 


Michael Hsu, president of animal-breeding 


e e company Shared Enterprises in Richlandtown, 
i 1mMma e | S Pennsylvania, says that activism is affect- 
ing companies such as his, which maintains 


a macaque breeding colony in Shanghai and 
relies on flights into Los Angeles to get its ani- 


Sup p ly of research monkeys to Western labs under threat mals to its clients. China Southern, he says, was 
as airlines react to animal-rights campaign. easily the biggest carrier serving Los Angeles 


"ANIMAL RESEARCH | 


22 MARCH 2012 | VOL 483 | NATURE | 381 
© 2012 Macmillan Publishers Limited. All rights reserved 


SOURCE: PETA/FWS 


SOURCE: AAVS/FWS/USDA 


| NEWS | IN FOCUS 


UP IN THE AIR 


In 2011, some 97% of the 18,044 non-human primates imported into the United States for use in 
research came from a handful of countries. Airlines are facing pressure to stop such shipments. 
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> airport. The other Chinese carriers serving 
the United States — Air China and China East- 
ern — have lower capacity, creating a bottle- 
neck for breeders lining up to ship animals. The 
situation, says Hsu, “is causing a big backlog. It's 
costing us money. We've been producing these 
animals for these markets, and if we can't ship 
them, it’s very difficult for us to sustain them? 

Breeding the animals in the United States 
instead would be problematic: infrastructure 
and labour costs are much higher than they 
are in Asia, and colonies are much more likely 
to become the targets of animal activists. And 
moving the animals by sea is a non-starter 
because of the deleterious effects of the six-week 
trans-Pacific journey on the animals’ health. 

Martin Paré, an associate professor at the 
Centre for Neuroscience Studies at Queen’s 
University in Kingston, Ontario, worries not 
only about his own access to primates, but 
about a slippery-slope effect. “If Air Canada 
was permitted to stop transporting non- 
human primates, what would be next? Stop- 
ping all animal transportation for research, 
which is what’s happening in the UK?” 


FREQUENT FLYERS 


The United States has increased imports of non-human 
primates (NHPs) over the past decade, supplementing 
a growing lab population. 
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Many airlines, including Lufthansa, Brit- 
ish Airways and Virgin Atlantic, already 
refuse to carry research primates. In fact, the 
China Southern shutdown at Los Angeles 
resulted in part from PETA putting pressure 
on Lufthansa, which unloads planes for China 
Southern at the airport. Activists pointed out 
to Lufthansa that unloading research primates 
for the Chinese carrier was at odds with its pol- 
icy on the transport of non-human primates. 
Within 24 hours, the China Southern macaque 
shipment was cancelled. Goodman speculates 
that “Lufthansa was anticipating a really bad 
PR situation”. Lufthansa says that it cannot 
comment on matters involving another carrier. 

At the time that it merged with United Air- 
lines in 2010, Continental Airlines was the last 
remaining US carrier that still transported 
research primates. United is preparing to pub- 
lish a policy for the merged airline later this year. 

“Nearly every major airline rejects the trans- 
port of primates to laboratories,” says Good- 
man. “The handful that are still doing it are 
facing more intense opposition and scrutiny 
than they ever have before. Because it’s not in 
their best interest to continue doing so, Iam 
sure that they will stop.” 


SEVERED LINKS 
The next several months could prove crucial. 
After last week’s meeting, China Southern is 
still deciding whether it will resume trans- 
porting research primates through Los Ange- 
les. If Air Canada’s petition to stop shipping 
primates is successful, Canadian researchers 
would be inconvenienced, but not entirely 
cut off — around 1,700 non-human primates 
were imported from the United States last year, 
mostly driven across the border. The Public 
Health Agency of Canada and Queen’s Uni- 
versity are challenging the airline's move, and 
a decision from the transportation agency is 
expected later this year. 

“The research that Queen’s University per- 
forms has significant benefits for the health 
and welfare of Canadians,” says Steven Liss, 
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the university's vice-principal for research. He 
adds that the humane treatment of the animals 
is “always” the university’s priority. “Air travel 
is simply the fastest and least stressful way of 
transporting them” 

Air France issued a statement last Septem- 
ber declaring that “nothing allows the airline 
to decide on the merits of the use of animals in 
biomedical research’, which it called “impor- 
tant and scientifically justified”. Nonetheless, 
after it was bombarded with PETA-orches- 
trated e-mails, phone calls and Facebook 
postings, the airline cancelled a scheduled 
1 February shipment from Mauritius of 
60 monkeys bound for a private laboratory in 
Everett, Washington. 

The apparent success of the public campaign 
has alarmed breeders, importers and biomedi- 
cal researchers. “It’s unfortunate that some 
airlines have chosen to capitulate to a small 
number of individuals with an agenda who 
arent truly representative of the general pub- 
lic” says Matthew Bailey, vice-president of the 
National Association for Biomedical Research 
in Washington DC, a trade organization that 
lobbies on lab-animal policy for drug firms 
and research universities. 

Tipu Aziz, a professor of neurosurgery at the 
University of Oxford, UK, has used macaques 

to study Parkinsons 


“Moreand more disease. He believes 
scientists will that restricting trans- 
go elsewhere port of animals will 
to do primate not have the desired 
research.” effect. “My gut feel- 


ing is that more and 
more scientists will go elsewhere to do pri- 
mate research,” he says. “I have no qualms 
about going abroad to do my work. There are 
quite a few countries that have good facilities: 
there are centres in India, Singapore, Malaysia, 
China.” 

Others say that the drug industry — a 
major user of research primates — may also 
be inclined to relocate activities if the flight 
barrier becomes insurmountable. 

“Let’s say [the activists] get their wish and 
no animal comes into the United States,” Hsu 
argues. “Merck is not going to say: ‘Okay, fine? 
They are going to go to other countries where 
animal care might not be as good, and start 
doing research there,’ 

Still, expert observers say that the pres- 
sure on the airlines is significant. “The public 
tide is turning against the use of non-human 
primates in general, and researchers must 
either find alternatives or convince the public 
there is a compelling reason to continue such 
research,’ says Paul Root Wolpe, director of 
the Center for Ethics at Emory University in 
Atlanta, Georgia, which is home to the Yerkes 
National Primate Research Center. If scientists 
don’t make a louder, more persuasive public 
case, he says, “more companies — not only 
in transportation — will choose the side of 
animal-rights advocates”. m SEE EDITORIAL P.373 
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Opioid receptors revealed 


Two more structures join the parade of once-intractable proteins. 


BY LIZZIE BUCHEN 


na small table in his office at the 
() Scripps Research Institute in La Jolla, 

California, Ray Stevens spreads 
out a sheet of paper covered with colourful 
branched lines, each sprouting and thinning 
before terminating in an esoteric code. 
“This is the dream,” he declares. 

The intricate diagram repre- 
sents the largest family of receptor 
proteins encoded in the genome 
— the G-protein-coupled recep- 
tors (GPCRs), ubiquitous cell- 
surface molecules that are 
activated by light, odours, 
hormones and neurotrans- 
mitters. Stevens wants to deter- 
mine the atomic structures of 
receptors on all branches of 
the tree. This week, that goal 
moved two receptors closer: 
Stevens's group has solved the 
atomic structure of the x-opioid 
receptor (k-OR)', and a team led by Brian 
Kobilka at Stanford University in California has 
solved the medically crucial u-opioid receptor 
(u-OR)’. The structures, published in Nature, 
bring the tally of GPCR structures solved this 
year alone up to five. 

Ten years ago, Stevens's dream would have 
seemed a hopeless fantasy — the complex, 
dynamic molecules had confounded relentless 
attempts to expose their atomic secrets by X-ray 
crystallography. But in 2007, several techno- 
logical breakthroughs brought the drought to 
an end’, and the structures began to trickle out, 
with one or two solved nearly every year. Now, 
it seems the flood is approaching. Stevens's and 
Kobilka’s groups each have another structure 
ready to go, on top of the five GPCRs they have 
already published this year*®. Stevens predicts 
several more from his lab by the end of the year. 

The opioid receptors are hot targets. Opioid 
drugs, including morphine, codeine and oxy- 
codone, are invaluable in medicine for their 
powerful painkilling and sedative effects. But 
they are also highly addictive, an enormous 
liability that limits their medical use. Their 
ability to produce a strong sense of eupho- 
ria also leads to rampant recreational use of 
prescription drugs and their more potent 
and deadly derivative, heroin. The majority 
of their effects, including analgesia, eupho- 
ria and sedation, are mediated by the u-OR, 
although most opioid drugs are promiscuous, 
activating x-OR and other opioid receptors as 


well. Understanding the molecular structure of 
these receptors is crucial for designing drugs 
that exert the beneficial actions of opioids but 
lack the unwanted side-effects. 

But opioid receptors, like other GPCRs, 
had long dodged the scrutiny of X-ray crys- 
tallography. The technique requires relatively 


When crystallized with a ligand (yellow), p-opioid 
receptors form an intimate pair. 


large amounts of purified protein; for proteins 
nestled in cell membranes, such as GPCRs, 
that means carefully extracting the molecules 
from their fatty environment while maintain- 
ing their delicate shapes. Identical copies of 
the protein must then be packed so tightly 
that they form a crystal. This step is particu- 
larly difficult for GPCRs, which have floppy 
loops that are resistant to staying in place and 
conforming to a regular crystal lattice. 

But one by one, the bottlenecks were 
circumvented. Kobilka, who had worked on 
crystallizing the B,-adrenergic receptor (6, AR) 
for more than 20 years, discovered that he could 
quiet the most restive loop by replacing it with a 
protein called T4L. In 2007, he and Stevens both 
crystallized the 8, AR using T4L and a medium 
called the lipidic cubic phase”®. Since then, all 
but one of the solved GPCRs have been cracked 
using this approach. 

The w-OR and the x-OR structures reveal 
that the binding pockets of both receptors 
are large and gaping, which Stevens suggests 
could explain how the x-OR recognizes such 
a diverse set of molecules. Kobilka and his 
colleague Sébastien Granier, now a molecu- 
lar pharmacologist at the French National 
Institute of Health and Medical Research in 
Montpellier, say that the openness of the n-OR 


binding site may also explain why the actions 
of opioids can turn on so quickly and be 
rapidly reversed. Heroin overdoses, for 
example, can be quickly treated with opioid 
antagonists such as naloxone. 

Kobilka and his colleagues also discovered 
that the p-OR crystallized in tightly linked 
pairs, complementing several other lines of 
evidence that the receptor pairs with itself 
and with other opioid receptors in the 
body. These pairings may mediate the 
various effects of opioid drugs. 
Philip Portoghese at the University 
of Minnesota in Minneapolis, who 

has been studying opioid recep- 
tors since the 1960s, is thrilled 
with the structures. “When I 
first started in the opioid field, 
crystallographic determina- 
tion of any receptor was viewed 
as science fiction,” he says. “Never in 
my wildest dreams did I think I would 
live to see the coordinates of not one, 
but two opioid receptors.” Portoghese 
points out, however, that both recep- 
tors were crystallized in their inactivated 
states, which he says may limit the value of the 
structures in the hunt for safer painkillers. 

Nor are crystal structures the be-all and 
end-all for rational drug design. GPCRs are 
not static; they writhe and twist, and crystal 
structures dont capture this crucial dynamism, 
which may be critical for designing drugs that 
can join in this dance. Kobilka, Stevens and 
many others are complementing crystallogra- 
phy with techniques that can capture protein 
movement, such as nuclear magnetic resonance 
and molecular-dynamics modelling. 

On Stevens's website, where the tree of 
dreams is prominently displayed, 14 victori- 
ous red flags mark solved GPCRs, and more 
will follow very soon. It took at least 20 years 
of dogged devotion to solve the B, AR — but 
today, when someone visits Stevens’s lab to 
learn about cystallography, he has them crys- 
tallize the once-uncrystallizable protein as an 
introductory exercise. It takes three days. = 
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Finance minister Pranab Mukherjee (centre, with other ministers) didn’t deliver hoped-for increases. 


Indian budget 
disappoints 


Downturn finally ends run of increases for science. 


BY K. S. JAYARAMAN IN BANGALORE 


funding — notching up annual increases of 

about 25% over the past five years — seems 
finally to have been hit by the global economic 
downturn. 

Last week, the government's budget for 2012- 
13 promised increases of between 2% and 18% 
for most of its scientific departments, disap- 
pointing researchers who note that the country’s 
7% inflation rate will consume most of the extra 
cash (see table). An apparent increase of 52% in 
the space-science budget was actually caused by 
a roll-over of unused funds from last year. 

Researchers had been expecting a major 
funding boost this year, after assurances from 
Prime Minister Manmohan Singh. In a speech 
at the annual Indian Science Congress in Bhu- 
baneswar in January (see Nature http://dx.doi. 
org/10.1038/nature.2012.9750; 2012), Singh 
lamented that China's progress in science 
has overshadowed India’s, and he promised 
a “major increase” in research and develop- 
ment spending, from the current 1% of the 


[ist remarkable growth in science 


gross domestic product to at least 2% by 2017. 
He also backed plans from the Department of 
Science and Technology (DST) for a 50-billion- 
rupee (US$1-billion) supercomputing facility 
in Bangalore; a 13.5-billion-rupee neutrino 
observatory near Pudukkottai in Theni district, 
Tamil Nadu; and a 3.5-billion-rupee project to 
improve monsoon forecasting. 

But on 16 March, finance minister Pranab 
Mukherjee'’s budget dashed scientists’ hopes. The 
country’s economic growth has slowed over the 


past year, prompting a cautious budget. “Less 
funding for science agencies will mean fewer 
research grants for university scientists,” says 
Ajay Sood, a physicist at the Indian Institute 
of Science in Bangalore, and president of the 
Bangalore-based Indian Academy of Sciences. 

Vishwa Mohan Katoch, secretary for the 
Department of Health Research, says that 
although he had requested 30 billion rupees for 
2012-13, his department had been allocated 
only 9 billion, an 18% increase on last year. 
Kanikaran Satyanarayana, deputy chief of the 
Indian Council of Medical Research, says that 
asa result, plans to increase funding for virology 
laboratories and establish a nationwide disease- 
surveillance network will have to be put on hold. 

“If we want to bring in quality changes 
we need big money,’ says Katoch. “But what 
can the government do if they do not have 
resources?” 

Thirumalachari Ramasami, secretary for 
the DST, is confident that the relative austerity 
of this year’s budget will not set a precedent. 
“Tam optimistic that funding will be restored 
[in 2013] once the 12th five-year plan, for 
2012-17, is finalized by the planning commis- 
sion,” he says. That plan is expected by next 
month. Ramasami adds that the big DST pro- 
jects outlined in Singh’s speech should not be 
affected by the disappointing increases. 

There is ample evidence that India’s previ- 
ous budgets have helped to advance science in 
the country, says Lingadahalli Shashidhara, a 
developmental biologist at the Indian Institute 
of Science Education and Research in Pune. “We 
have quadrupled the number of research faculty 
members in just ten years,’ he says. “The quality 
of projects coming to funding agencies is going 
up, and India is publishing research papers in 
high-impact journals more frequently.’ 

But some researchers feel that Indian science 
needs reform much more than it needs extra 
cash. Rajesh Kochhar, a historian of science 
at the Indian Institute of Science Education 
and Research in Mohali, says that research 
institutions must be more strictly audited to 
divert funding away from poorly performing 
centres and give more to the most productive 
researchers. Increased funding cannot help to 
advance science “unless the weeds are ruth- 
lessly removed so that the flowering plants can 
flourish’, he says. = 


DISCOURAGING RESULT 
Budget allocations in billions of rupees. 

2011-12 2012-13 % increase 
Atomic energy 67.24 76.51 13.8 
Department of Health Research Tigh 9.08 lg) 
Department of Science and Technology 26.06 28.73 10.3 
Department of Scientific and Industrial Research | 32.17 34.84 8.3 
Department of Biotechnology ieh7/7/ 15.00 9.0 
Department of Agricultural Research 50.08 53:92 yi 
Oceanographic research 6.99 7.13 18 
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Clashing nations back SESAME 


Participants put politics aside to find cash for construction of Middle-East synchrotron. 


BY GEOFF BRUMFIEL 


but they have found common ground in a 
scientific project. 

Despite brinkmanship over Iran’s nuclear 
programme, the two nations have joined with 
Jordan and Turkey to commit crucial funding 
to the construction of Synchrotron-light for 
Experimental Science and Applications in the 
Middle East (SESAME). Under an agreement 
announced on 13 March, the partners have 
promised a total of US$20 million towards the 
construction of the project. SESAME is now 
looking for a further $15 million from other 
partner nations to complete the synchrotron’s 
first four beamlines in 2015. 

“The fact that we could get such an agree- 
ment given the political situation is kind of 
remarkable,’ says Chris Llewellyn Smith, the 
president of SESAME’s governing council and 
a past director-general of CERN, Europe's 
particle-physics lab near Geneva, Switzerland. 


[= and Iran can agree on little these days, 


Based in Allan, outside Amman, Jordan, 
SESAME aims to build scientific expertise in 
its member countries and bolster cultural ties 
between them. When completed, the $110-mil- 
lion machine would be the Middle East's only 
source of high-intensity synchrotron X-rays, 
which are used in fundamental research rang- 
ing from molecular biology to materials science. 

The project mirrors the organization of 
CERN, but Amman is along way from Geneva 
— metaphorically as well as literally. In recent 
years, SESAME has had to contend with the 
assassination of two members of its Iranian 
delegation and huge political upheavals in 
partner countries including Egypt. 

Despite the tensions, scientists involved 
with the project say that SESAME is succeed- 
ing because it has something to offer all the 
participants. Israel, for example, gets a world- 
class research facility on its doorstep and an 
opportunity to show its willingness to collabo- 
rate with neighbours, says Eliezer Rabinovici, 
director of the Institute for Advanced Studies 


at the Hebrew University of Jerusalem, anda 
member of SESAME'’s council. Other member 
states see an opportunity to educate a young 
and rapidly growing science base. “Turkey is 
also interested in developing similar laborato- 
ries and SESAME would bea very good experi- 
ence, says Zehra Sayers, a physicist at Sabanc1 
University in Istanbul and head of SESAME’s 
scientific advisory committee. 

Llewellyn Smith is optimistic that the 
remaining money can be found. Now that local 
partners have committed funding, the project is 
more likely to gain support from outsiders such 
as the United States and the European Union. 
And other partners in the Middle East, notably 
Egypt, may yet provide cash, although political 
turmoil could make commitments difficult. 

Rabinovici says that SESAME shows the 
strength of scientific collaboration, even under 
the most difficult circumstances. “This is an 
existence proof that scientists can, at least for 
a period of time, transcend the politics around 
them,” he says. = 
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DNA donor rights affirmed 


NIH committee urges that genome study subjects be told of medically relevant results. 


BY ERIKA CHECK HAYDEN 


a subject’s DNA is collected for one study, 

deposited in a database or biobank and 
then analysed by other researchers for sepa- 
rate studies. But what happens when 
a later study stumbles on something 
that could be of significance for the 
donor, such as an allele for familial 
hypercholesterolaemia — a treatable 
genetic disorder that causes pro- 
gressive atherosclerosis — or some 
other health-related variation? Do 
researchers conducting secondary 
studies and biobanks have a duty 
to share such revelations with the 
original research subjects? 

They do, when possible, accord- 
ing toa detailed consensus statement 
from a working group funded by 
the US National Institutes of Health 
(NIH) in Bethesda, Maryland, and 
published this week (S. M. Wolf et al. 
Gen. Med. http://dx.doi.org/10.1038/ 
gim.2012.23; 2012). The statement’s 26 signa- 
tories consulted dozens of other researchers 
and biobank managers over a two-year period. 
They conclude that biobanks “shoulder signifi- 
cant responsibility” for addressing how to deal 
with ‘incidental findings’ — those research 
results that could have medical consequences 
for the donors of genetic material. 

Genetics researchers are divided on the 
matter of incidental findings. Conventional 
research ethics holds that participants should 
not be told of their individual results, to keep 
them from expecting to benefit person- 
ally from the study. What’s more, reporting 
such findings can be a logistical challenge 
because many studies strip identifying 
information from donated samples. But, 
increasingly, geneticists are embracing the idea 
that research participants have a right to know 
of any unwelcome surprises in their genome. “If 


> 


MORE 
ONLINE 


| is a familiar scenario in genetic research: 


One researcher’s 
genome shows 
personalized 


we really believe this is medically valuable and 
useful data, then we have to act on it? says Leslie 
Biesecker of the US National Human Genome 
Research Institute in Bethesda, who contributed 
to the discussions that led up to the consensus 
statement but is nota signatory. 


Genetic testing is increasingly coming up with ‘incidental findings’. 


The need to establish policies for the return 
of results has grown with the proliferation of 
whole-genome sequencing, says James Evans, 
editor-in-chief of Genetics in Medicine, which 
is publishing an entire issue on the return of 
results in genetic research, along with the 
consensus statement. 

In the past, says Evans, medically signifi- 
cant incidental findings were an exception 
in genetic studies, but “when you do whole- 
genome sequencing you will regularly come up 
with findings that have some medical import”. 
Yet less than half of US biobanks return results 
to participants, according to a study done 
for the working group by Mao Thao of the 
University of Minnesota in Twin Cities and 
colleagues. The working group, coordinated 
by Susan Wolf of the University of Minnesota 
Law School in Minneapolis, urges biobanks to 
take the lead in changing that ethos. The group 
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recommends that each biobank sets up a com- 
mittee to oversee the return of results and also 
that a single central advisory body be created 
that would foster consistency among biobank 
research systems. Wolf led a previous NIH 
working group that in 2008 published recom- 
mendations proposing that primary 
researchers — those responsible for 
collecting data — should report some 
incidental findings back to research 
participants (S. M. Wolf et al. J. Law 
Med. Ethics 36, 219-248; 2008). 

But some researchers warn that 
keeping track of incidental results and 
re-identifying participants so that 
they can be informed could prove 
costly and pose ethical and legal dif- 
ficulties. “It’s unfortunate that the 
authors of the consensus statement 
didn’t discuss the cost implications 
of what they’re proposing, because 
what they have in mind is going to 
be expensive and difficult, particu- 
larly at a time when funding success 
is as low as it’s ever been,” says Ellen 
Wright Clayton, a paediatrician and lawyer at 
the Center for Biomedical Ethics and Society at 
Vanderbilt University in Nashville, Tennessee. 

Wolf counters that some biobanks already 
do return results, so it does not need to be 
prohibitively costly or difficult. For instance, 
Biesecker runs a study called ClinSeq, which 
has enrolled around 900 people whose exomes 
(the protein-coding part of the genome) are 
being sequenced. The participants are told 
about any findings that could have implica- 
tions for their health care. Biesecker says 
that there is an increasing tension in the field 
between studies such as his that underscore 
the usefulness of genetic medicine and an 
assumption that it is too costly and difficult to 
use the results of such studies to help improve 
patients’ lives. “I don't think, asa field, that we 
can continue to say both things at the same 
time,’ Biesecker says. m SEE EDITORIALP.373 
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Optical components including modulators, waveguides and photodetectors crowd a silicon chip. 


TECHNOLOGY 


Photonic chips 
made easier 


Shared-production system aids academics and start-ups. 


BY KATHERINE BOURZAC 


more, you will need a faster messenger. That 

is the idea behind silicon photonics, which 
whizzes data around a silicon chip using light 
instead of sluggish old electrons. 

Photons already carry data in fibre-optic 
cables for telecommunications. But using light 
in computing means carving transistors and 
lasers out of the same wafer of silicon, which 
needs an advanced manufacturing ‘foundry. 
Only large companies and a few research insti- 
tutions with deep pockets have these facilities, 
and some researchers worry that lack of access 
is slowing progress in photonics. 

A project called Optoelectronic Systems 
Integration in Silicon (OpSIS) now promises 
a solution. Based at the University of Washing- 
ton in Seattle and at the University of Delaware 
in Newark, OpSIS provides design support 
for researchers in silicon photonics; organizes 
shared production runs; and arranges low-cost 
access to advanced foundries so that academ- 
ics and start-up companies can create proto- 
type devices. The US Air Force is the project’s 
biggest funder, with more support coming 
from Intel, based in Santa Clara, California, 


E you want to speed up computers much 


and design-automation company Mentor 
Graphics in Wilsonville, Oregon. 

OpSIS will make its first batch of silicon 
photonics prototype chips at the Institute of 
Microelectronics’ foundry in Singapore this 
month. Its customers are some 25 groups 
working on high-speed communications 
circuits, portable medical diagnostic devices 
and photonics basic research. 

Users will pay US$1,500 for one square milli- 
metre of wafer space, with projects requiring 
anywhere from 10mm‘ to 40 mm”. Researchers 
going it alone would have to pay for a complete 
production run at the foundry — at a cost many 
times higher. OpSIS offers much lower prices by 
putting many different projects into a single run 
and splitting the cost. 

Michael Hochberg, a photonics researcher at 
the University of Delaware who co-founded and 
leads OpSIS, knows how expensive it can be to 
translate silicon photonics technology from uni- 
versity lab into product. In 2001, he co-founded 
Luxtera, in Carlsbad, 
California, the first com- 
pany to offer integrated 
electronics and optics 
on chips, which are now 
used commercially in 
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data centres and supercomputers. Luxtera spent 2 
more than $100 million to develop and produce < 
its first product, released in 2010. “I want to 2 
make it possible to do what Luxtera did for an 
order of magnitude less cost,” says Hochberg. 
OpSIS will soon also have access to foundries 
owned by Luxtera and by UK aerospace com- 
pany BAE Systems, headquartered in London. 

The history of the silicon chip suggests that 
OpSIS could be hugely influential. Its shared- 
production model is based on the University of 
Southern California's Metal Oxide Semiconduc- 
tor Implementation Service (MOSIS), which 
has been operating since 1981 in Marina del 
Rey and has helped the semiconductor indus- 
try to grow rapidly. Electronics pioneers, includ- 
ing Carver Mead at the California Institute of 
Technology in Pasadena, pushed for academics 
and small companies to gain access to fabrica- 
tion facilities (‘fabs’) to build the integrated cir- 
cuits they needed to test their ideas and train 
students. At the time, academics were restricted 
to making simple devices in the lab, doing theo- 
retical work or — as Mead did — having friends 
at Intel sneak their students’ designs into pro- 
duction runs. “You have to have access to a fab 
to build the real thing? says Mead. 

Silicon photonics now is at a similar stage 
to the electronics industry in the early 1980s, 
says Mario Paniccia, director of the photon- 
ics technology lab at Intel. Today’s photonics 
researchers have better access to foundries 
than Mead did decades ago, but most are still 
excluded from the cutting-edge facilities that 
can make more complicated devices. 

A data-processing photonic chip, for exam- 
ple, must integrate lasers, waveguides and 
electrical transistors in complex systems that 
are completely reliable. Making such devices 
in advanced fabrication facilities ensures that 
they are of acceptable quality, says OpSIS user 
S. J. Ben Yoo, director of the Center for Infor- 
mation Technology Research in the Interest of 
Society at the University of California in Davis. 

A service similar to OpSIS has been offered 
since 2008 by the ePIXfab initiative, funded in 
part by the European Union. But OpSIS has 
access to more advanced foundries, says Lukas 
Chrostowski, an electrical engineer at the 
University of British Columbia in Vancouver, 
Canada. 

Chrostowski has 20 designs in the first 
OpSIS run. Several are prototype versions of 
optical sensors for a hand-held medical-diag- 
nostic device that could analyse blood samples 
to detect disease biomarkers. Previous versions 
of this system were far from being hand-held: 
Chrostowski had to attach an external laser 
and a detector to an optical chip he made using 
ePIXfab. Through OpSIS, he will now be able 
to integrate all the building blocks onto one 
chip in a more realistic prototype. 

This sort of innovation cannot be left only 
to well funded companies, says Paniccia. “We 
need more people experimenting, learning, 
and innovating with silicon photonics.” m 
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Mice guide human drug trial 


Parallel approach to cancer study provides genetic insights. 


BY HEIDI LEDFORD 


he mouse hospital at the Dana-Farber 
| Cancer Institute in Boston, Massachu- 
setts, boasts resources rivalling those 
offered to human patients, including state-of- 
the-art medical-imaging facilities, round-the- 
clock nursing staff and access to cutting-edge 
cancer drugs. The goal is not to cure mice, of 
course, but to gain insight into clinical trials of 
those drugs in humans. 

The roughly 5,000 rodent residents at Dana- 
Farber are pioneers in an approach known asa 
co-clinical trial, in which a human clinical trial 
is partnered with animal studies designed to 
reveal clues about why some patients respond 
to a treatment and others fail to do so. The 
first findings from the Dana-Farber project, 
published online on 18 March (Z. Chen et al. 
Nature http://doi.org/hrr; 2012), suggest that 
adding an experimental drug called selu- 
metinib to standard chemotherapy may help 
to shrink lung tumours in patients with muta- 
tions in a gene called KRAS. The parallel study 
in the mouse revealed, however, that the drug 
may fail in those who also carry a mutation in a 
tumour-suppressor gene called LKB1. 

“Normally, very little scientific interrogation 
occurs during the conduct of a human cancer 
trial? says David Tuveson, a cancer researcher 
at the University of Cambridge, UK, who was 
not involved in the study. “This is a logical and 
badly needed path forward.” 

It is standard for new drugs to undergo 
animal studies before they reach the clinic. 
But early-stage animal trials can be mis- 
leading because they often fail to mimic the 
conditions of subsequent human trials, says 
Andrew Kung, an oncologist at Dana-Farber 
and an author on the mouse study. By contrast, 
a co-clinical trial is designed to closely match 
a parallel trial in human subjects. 

Selumetinib, developed by Array BioPharma 
of Boulder, Colorado, and London-based 
AstraZeneca, is a member ofa hotly pursued 
class of cancer drug that inhibits a crucial bio- 
chemical step promoting tumour-cell prolif- 
eration and survival. Full results of the human 
trial, which enrolled 87 patients with KRAS 
mutations, have yet to be published. Array Bio- 
Pharma announced last year that, when given 
with standard chemo- 


therapy, the drug slowed NATURE.COM 
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A genetically engineered mouse lung-cancer model offers clues to its human analogue. 


Like the human subjects, the hundreds of 
mice used for the co-clinical trial carried a 
mutated form of Kras, but some also contained 
mutations in the tumour-suppressor genes 
p53 and Lkb1. Mice treated with selumetinib 
and standard chemotherapy benefited if they 
had the additional mutation in p53 alone, but 
the Lkb1 mutation seemed to render the drugs 
ineffective. Outcomes of the mouse trials were 
measured using methods similar to those for 
the human subjects, which meant relying on 
non-invasive imaging techniques, rather than 

simply killing the ani- 
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Overall, the mouse 
study suggests that, 
in future trials of 
selumetinib, patients should be screened for 
the presence of LKB1 mutations, says Kung. 
That would allow researchers to determine 
whether a genetically identifiable subset of 
patients resistant to the drug may be affecting 
their results. 

The results in mice also suggest that an 
imaging technique called FDG-PET can 
measure short-term tumour responses to 
selumetinib. The method could be a boon for 
human studies, because patients generally can- 
not be subjected to repeated tumour biopsies. 

The co-clinical approach could speed 
up drug development as oncology increas- 
ingly embraces complex treatment regimens 
involving multi-drug cocktails and tailored 


condition.” 


for specific genotypes, says Hugues de Thé, a 
cancer researcher at the Paris Diderot Univer- 
sity. He has used a similar approach in a trial 
of drugs for a form of leukaemia, but says that 
the new work is a “technical tour de force” 
because of the genetic complexity of the 
tumours studied. 

Many other centres are developing facilities 
for co-clinical trials, but the experiments can 
be prohibitively expensive. The drugs often 
need to be given daily, which requires more 
personnel to handle the animals, and the 
technology used to image tumours in living 
animals can be costly. Mouse hospitals will 
probably have to invest heavily in generating 
trial-specific mouse models of cancer, cautions 
Tuveson, who has been using the approach to 
tackle pancreatic cancer. “You need the infra- 
structure to breed these animals and then to 
follow them just like you would in a hospital, 
and most laboratories do not have that.” 

But funders are aware of the approach’s 
promise. In 2009, the US National Cancer 
Institute in Bethesda, Maryland, awarded 
US$4.2 million to Pier Paolo Pandolfi, a 
cancer researcher at Harvard Medical School 
in Boston who is a co-author on the Nature 
study, to establish the co-clinical trial pro- 
ject in lung and prostate cancer. Additional 
grants from Stand Up To Cancer, a project 
of the Entertainment Industry Foundation 
charity, based in Los Angeles, California, are 
funding co-clinical trials in breast and ovar- 
ian cancer. It is a new way, says Tuveson, “to 
use mice as a looking glass into the human 
condition” = 
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RAISING THE DEAD 


To dissect evolution, Joe Thornton resurrects proteins that have been extinct for 
many millions of years. His findings rebut creationists and challenge polluters. 


alfway through breakfast, 

Joe Thornton gets a call from 
IH his freezer. A local power cut 

has triggered an alarm on the 

—80°C appliance in his lab at 

the University of Oregon in 
Eugene, and it has sent out an automatic call. 
Thornton breaks off our conversation and calls 
his senior research scientist, Jamie Bridgham, 
to make sure that the back-up generator has 
kicked in. If the freezer starts warming up, a lot 
could be lost — not least a valuable collection 
of proteins that had been extinct for hundreds 
of millions of years until Thornton and his 
team brought them back from the dead. 

One deep-frozen vial holds the more-than- 
600-million-year-old ancestor of the receptors 
for oestrogen, cortisol and other hormones, 
which Thornton brought to life’ nine years ago. 
Other tubes house proteins more than 400 mil- 
lion years old, which Thornton resurrected a 
few years later to show how an ancient receptor 
had changed its preferences — and how the 
march of evolution cannot be reversed” *. In 
another corner of the freezer rest the ancient 
protein components ofa sophisticated cellular 
machine that acquired a more complex form 
through random mutations rather than selec- 
tion for superior function, as the group showed 
in Nature this January’. The sheer awe of work- 
ing with long-dead proteins doesn't fade, says 
Thornton. “It’s amazing. The ability to do this 
type of time travel is fantastic.” 

Thornton is a leader ina movement to do for 
proteins what the scientists in Jurassic Park did 
for dinosaurs: bring ancient forms back to life, 
so that they can be studied in the flesh. “Instead 
of passively observing things as most evolu- 
tionary biologists do, you actively go in and test 
the hypotheses experimentally,” says Antony 
Dean, a molecular biologist at the University of 
Minnesota in St Paul who heads another major 
group in the field. “His is one of the leading 
labs, no doubt” And Thornton is tackling some 
important questions, says Kenneth Miller, a 


BY HELEN PEARSON 


molecular biologist at Brown University in 
Providence, Rhode Island. “He’s helping to put 
some flesh on the bones of speculation about 
how complexity arises.” 

What isn’t so widely known is that evolu- 
tionary biology is Thornton's second career: 
in his first, he was an activist for Greenpeace, 
campaigning vigorously against the release of 
toxic chemicals. He wrote a controversial book 
on organochlorines: industrial chemicals that 
include dioxins, polychlorinated biphenyls 
(PCBs) and pesticides such as DDT. That activ- 
ist legacy bleeds into his work today, for example 
in his focus on the oestrogen receptor, which 
is corrupted by many pollutants. The grubby, 
sea-green tiles under Thornton's lab benches 
were carefully sourced to be free of polyvinyl 
chloride (PVC), one of the organochlorines that 
worries him most. His activist past also helps 
to explain why he has been fearless — almost 
enthusiastic — about highlighting the challenge 
that his work presents to a creationist argument 
called intelligent design: the claim that complex 
molecular systems can only have been created 
by a divine force. Thornton shows how evolu- 
tion did the job, leaving no need for a designer. 


ENVIRONMENT TO EVOLUTION 

Thornton says that his activist days — dur- 
ing which he saw that many risk-assessment 
models were shot through with assumptions 
and biases — left him “intensely committed 
to methodological reductionism and experi- 
mentalism’, which he now uses to break evolu- 
tion down into detailed steps that he can test. 
“If you're doing science, I think it ought to be 
as strong and decisive as possible,” he says. “If 
you're doing politics, go ahead, but don't try to 
disguise it as science.” 

Thornton’s unconven- 
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Haven, Connecticut. But the course, with its 
focus on the philosophy of criticism rather than 
literary texts, left him with a hunger for real- 
ity, and nothing seemed more real than politics 
and activism. He dropped out of college, signed 
up with Greenpeace and spent several months 
doorstepping to canvass people for money and 
support. 

In the early 1990s, Greenpeace was cam- 
paigning against sources of toxic pollution, and 
Thornton was drawn in. He became the ‘science 
guy, translating the scientific literature into 
reports and other material that communities 
and Greenpeace could use to make their case. 
“You could rely on Joe when you didn’t have 
enough knowledge of an issue,” says Charlie 
Cray, a research specialist at Greenpeace in 
Washington DC, who worked with Thorn- 
ton. His reports “put a challenge out there that 
industry couldn't answer”. One campaign that 
Thornton helped to organize, against plans to 
build more than 100 hazardous-waste incin- 
erators across the United States, climaxed in 
May 1993 when Greenpeace parked a truck 
dressed up as an incinerator outside the White 
House and some 60 people chained themselves 
to it. The next day, the Environmental Protec- 
tion Agency announced a moratorium on new 
hazardous waste incinerators. 

But Thornton was growing older, and 
yearning to “develop my own body of work”. 
His time with Greenpeace had taught him the 
power of science to influence society, and his 
ambitions turned to research. First, he had to 
deal with the small matter of graduating from 
Yale. Then living in New York, he did that by 
accruing course credits at Columbia University 
— attending his first molecular-biology classes 
aged 30 — only to find himself rejected from 
almost every graduate programme he applied 
to, in part because of his unusual CV. 

Of the seven friends and colleagues of 
Thornton’s who spoke to Nature, six called 
him intense. The seventh described him 
as “beyond intense”. But only a little of that 
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intensity is apparent at his Wednesday morning 
lab meeting in Eugene. The freezer crisis has 
blown over: the power came back after half an 
hour and the thermometer rose to only —76°C. 
Now graduate student Dave Anderson gets a 
friendly grilling during a practice talk outlining 
his thesis proposal: to trace the evolution of the 
DNA-binding domain of an ancient hormone 
receptor. The meeting stretches on for 2.5 hours 
— not uncommon in this lab, everyone says. 


A BINDING FASCINATION 

Since his Greenpeace days, Thornton has been 
fascinated by the steroid hormone receptors: 
in vertebrates, six proteins that sit in the cell 
nucleus and control the activity of genes. By 
binding specific ‘ligands’ — hormones rang- 
ing from oestrogens and androgens to corti- 
sol — the receptors trigger “these remarkable 
cascades of biological activity during develop- 
ment and physiology’, Thornton says. “Their 
affinity for their hormones is just stunning. 
A drop of hormone in a railroad tank car of 
serum is enough” — and yet, as Thornton 
learned at Greenpeace, they can be waylaid by 
toxic substances. “I wanted to know where that 
system came from,’ he says. 

When he was finally accepted for a PhD at 
Columbia, he set about comparing receptor 
genes from living organisms to piece together 
a detailed history of how the receptor family 
had evolved’. 

Just as Thornton completed his first year of 
graduate study, however, MIT Press called to 
ask if he would write a book on organochlorine 
pollution. He worked in the lab during the day 
and wrote at night, ina tiny room in his Brook- 
lyn apartment, encircled by towers of papers 
that eventually formed the nearly 1,200 ref- 
erences and 611 pages of Pandora’s Poison, 
which came out in 2000. “I was shocked 
when I saw the book,’ says Rob DeSalle, who 
studies molecular evolution at the American 
Museum of Natural History in New York, and 
co-supervised Thornton's PhD. “He could’ve 
been writing War and Peace and I wouldn't 
have known it” 

The book caused a stir. Drawing on argu- 
ments that he had formulated at Greenpeace, 
Thornton made the case that regulatory pol- 
icy should focus on managing classes of toxic 
chemicals rather than tens of thousands of 
substances, one by one — and that the priority 
should be organochlorines. These substances, 
generated by the use of chlorine gas in the 
chemical and paper-making industries, have 
properties of stability and solubility that make 
them desirable to industry but problematic to 
the environment because they are long-lived 
and accumulate in animal tissues. Nature’s 
review called Pandora’s Poison a “landmark” 
and another review compared it to Rachel Car- 
son’s famous 1962 treatise on pollutants, Silent 
Spring. The Chlorine Chemistry Council in 
Washington DC, however, decried Thornton's 
“hyperbole and faulty risk analysis”. 


But Thornton was already gearing up to 
make a different kind of splash, with his first 
paper in Science’. He and his team trampled 
the assumption that only vertebrates have ster- 
oid hormone receptors by cloning one from 
the sea slug Aplysia californica. The finding 
implied that the origin of the receptor gene was 
far more ancient than anyone had realized. “I 
would've hated to be a fellow grad student. He 
was writing a book and publishing in Science 
and having two children at the same time,” 
says Darcy Kelley, a biologist at Columbia and 
Thornton's other PhD co-supervisor. 

The approach that Thornton took in the 
2003 study is one that he has loosely followed 
ever since. Starting with the genes for ster- 
oid hormone receptors from a slew of living 
organisms, he clambered backwards through 
the evolutionary tree to deduce the most likely 
sequence of the common ancestor of all such 
receptors, which existed some 600 million to 
800 million years ago, in the common ances- 
tor of “you and a snail’, as he puts it. Instead of 
stopping there, as most evolutionary biologists 
would have done, he then built the gene and 
inserted it into cells that could manufacture 
the ancient protein. 

Resurrecting the protein, says Thornton, 
allowed his team “to experimentally test 
hypotheses about evolution that would oth- 
erwise be just speculation”. They went on to 
show’ that the ancestral receptor was sensitive 
to oestrogens but not to related hormones — 
supporting the idea that the family of receptors 
evolved through a series of gene duplications 


“HE’S HELPING TO 
PUT SOME FLESH 
ON THE BONES 
OF SPECULATION 
ABOUT HOW 
COMPLEXITY 
ARISES.” 


and that the copies gradually evolved affinities 
for other ligands (see ‘Receptors, resurrected’). 

By the time his paper came out in Science, 
Thornton had taken a faculty position in 
Eugene, an old hippy town that pays as much 
homage to bicycles as it does to cars. He built 
a house (no PVC, sustainable bamboo floors) 
and set to work building up his protein- 
resurrection lab. 

Thornton wanted to delve deeper into the 
puzzle of how complex systems with tightly 
interacting molecular parts evolve. It was a 
long-standing conundrum. As Charles Darwin 
wrote in On the Origin of Species: “If it could be 
demonstrated that any complex organ existed 
which could not possibly have been formed by 
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numerous, successive, slight modifications, 
my theory would absolutely break down.” And 
what was an evolutionary puzzle to biologists 
was a target for evolution’s critics. Michael 
Behe, a biochemist at Lehigh University in 
Bethlehem, Pennsylvania, and a senior fellow 
at the Discovery Institute in Seattle, Washing- 
ton, proposed in the 1990s that such systems 
— the blood-clotting cascade, for example, or 
the molecular motor called the flagellum — are 
so “irreducibly complex” that they could not 
have evolved step by step, and can only be the 
product of intelligent design. 

Thornton says that he didn't set out to refute 
intelligent design, but the prospect of a fight 
hardly put him off. “Been there, enjoyed that; 
he says. He chose to explore a pair of steroid 
hormone receptors: the mineralocorticoid 
receptor (MR), which binds the hormone 
aldosterone and regulates salt and water bal- 
ance; and the closely related glucocorticoid 
receptor (GR), which binds cortisol and 
controls stress response. A gene duplication 
more than 450 million years ago produced the 
two receptors — but aldosterone didn't arise 
until many millions of years later. The timing 
seemed to make the MR a textbook example 
of irreducible complexity: how could selection 
drive the evolution of a lock (the MR) to fita 
key (aldosterone) that didn’t yet exist? 


EVOLUTION AT WORK 

Led by Bridgham, Thornton's team found the 
answer by resurrecting the ancestor of both 
receptors. To their surprise, it was sensitive to 
aldosterone, suggesting that it had been acti- 
vated by an ancient ligand with a similar struc- 
ture’. Once aldosterone had evolved, the team 
proposed, evolution was able to take advan- 
tage of the existing receptor to control a new 
biological function — a process that Thorn- 
ton termed molecular exploitation. They also 
showed how its sister receptor, the GR, was 
evolving functions of its own. 

“Such studies solidly refute all parts of the 
intelligent design argument,’ wrote Christoph 
Adami, an evolutionary biologist at the Keck 
Graduate Institute of Applied Life Sciences in 
Claremont, California, in an article entitled 
‘Reducible complexity”. But Behe dismissed 
the result. The receptor and ligand are not 
irreducibly complex, he says, and evolution 
did not give them any truly new function. “I 
think his results are quite consistent with my 
own view that Darwinian processes are poor 
ones to explain the complexity found in life,” 
Behe told Nature. 

Thornton turned up more clues to the work- 
ings of evolution when his team explored the 
history of the GR, which became sensitive only 
to cortisol over the course of about 20 million 
years. Working with structural biologists at the 
University of North Carolina, Chapel Hill, the 
group determined the crystal structure of the 
common ancestor of the GR and MR. They 
showed’ that two crucial mutations together 


RECEPTORS, RESURRECTED 


The vertebrate steroid hormone receptors evolved from a common ancestor that 


existed 600 million to 800 million years ago (Mya). Resurrecting ancient proteins 
on its family tree has revealed key events in their evolution. 
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altered the binding pocket of the ancestral 
receptor so that it preferred to bind cortisol 
— and identified another five mutations that 
finished the job. 

Ina final chapter to the story, Thornton tried 
to run that evolutionary sequence backwards. 
But when the researchers reversed the seven 
mutations in the ancient cortisol-specific form, 
they could not transform it back into a protein 
that worked like the common ancestor of the 
GR and MR. They instead engineered a dud, 
unable to respond to any hormone’. That was 
because a handful of other mutations had also 
cropped up on the way to making a cortisol- 
specific receptor. They played little part in the 
receptor’s new function, but acted as an evolu- 
tionary ratchet, preventing it from regaining 
its old one. 

Thornton showed that it was necessary to 
undo those mutations too, to reverse the change. 
To him, the work was a powerful demonstration 
that the path of evolution can be contingent on 
random events. “Chance plays a very large role 
in determining what evolutionary outcomes are 
possible,” he says. The study captivated the sci- 
entific press — and beyond. “Evolution opens 
gateways into the future. But it appears to close 
them — firmly — behind it as well,” read an edi- 
torial in the New York Times. 

In the Nature article that was published this 
year” , Thornton took a break from hormone 
receptors, and instead collaborated with Tom 
Stevens, a geneticist at Eugene, to dissect the 
evolution of V-ATPase, a molecular machine 
that pumps protons across membranes to 
acidify compartments inside cells. The group 


Two crucial mutations cause 
shift away from oestrogens 


Common ancestor 


wanted to know how an essential part of the 
machine — a ring of proteins that spans cell 
membranes — evolved from an ancestral form 
with two components to one with three. 

With their protein-resurrection toolbox, 
the researchers showed that, around 800 mil- 
lion years ago, the ancestral gene coding for 
one protein component was duplicated, and 
the daughter genes then picked up two vital 
mutations. The changes meant that the proteins 
could no longer sit anywhere in the ring, but 
instead had to occupy a specific spot. Suddenly, 
the ring could function only with all three parts. 
What surprised Thornton was that the three- 
component ring seemed to work no better 
than its two-component counterpart. Random 
mutations that actually corrupted proteins had 
led to ‘irreducible complexity. 


COMPUTING COMPLEXITY 
The study flipped another finger to intelli- 
gent-design proponents — but “I’m sort of 
bored with them’, Thornton says. He is more 
excited by the next scientific story that is about 
to come out of the lab. His group wanted to 
explore how the ancestor of the entire steroid 
hormone receptor family, which was sensitive 
only to oestrogens, evolved into forms sensi- 
tive to other hormones. And this time, he found 
no clues in the crystal structures of resurrected 
proteins from before and after the change. 
The answer can be found on a computer 
screen at the end of Thornton’s lab. Mike 
Harms, a postdoc who joined the lab three 
years ago, used his expertise in biophysics 
and some immense computational power 
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to simulate the movements of every atom in 
the ancestral receptors, showing how just two 
mutations drove the transformation. When 
Harms hits play, an oestrogen molecule snug- 
gles its way into the binding pocket ofa recep- 
tor roughly 550 million years old. But when 
he runs a simulation of the same receptor with 
those two mutations, the oestrogen never finds 
a comfortable spot. 

This evolutionary story also sheds light on 
why the oestrogen receptor is now vulnerable 
to the threats against which Thornton cam- 
paigned in his former life. The team worked 
out that each steroid receptor evolved to be 
only as specific as it had to be to bind its target 
ligand and exclude all others that existed at the 
time. The oestrogen receptor achieves this by 
binding substances that contain a chemical 
structure called an aromatized A ring. Because 
oestrogens are the only steroid hormones to 
have such a ring, that criterion was enough 
to ensure that the receptor bound only oes- 
trogens for many millions of years. Until, that 
is, the chemical industry started pumping out 
hundreds of substances containing such aro- 
matized rings, which the oestrogen receptor 
unwittingly bound. “The endocrine disrupt- 
ers are taking advantage, unfortunately, of the 
promiscuity that is the result of the evolution- 
ary history of receptors,’ Thornton says. 

Thornton does see progress on the issues 
on which he once campaigned. Production of 
toxic chemicals in the United States has fallen 
since his days with Greenpeace, and in 2007 the 
European Union enacted REACH (Registra- 
tion, Evaluation, Authorisation and Restriction 
of Chemicals), which emphasizes elimination 
of the most dangerous substances. That law 
puts the onus on the chemical industry to show 
that a chemical is safe rather than on regula- 
tors to prove it is dangerous — the approach 
for which Thornton argued in Pandora’s Poison. 

Does he miss having something to campaign 
against? Yes and no. “I’m less able to convince 
myself that the world has to be exactly as I 
envision it. So it’s harder for me to occupy that 
activist persona.” Besides, he says, “My kids 
take all that energy now.” 

Or almost. His creations need tending too. 
Back in his office, we listen to the tinny voice- 
mail message left by the freezer on his phone 
earlier that day. “The past is calling,” Thornton 
says. 


Helen Pearson is Nature’ chief features 
editor. 
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Is a project to map the brain’s full 
communications network worth the money? 


building that once housed a Second 

World War torpedo factory seems an 

unlikely location for a project aiming 
to map the human brain. But the Martinos 
Center for Biomedical Imaging — an outpost 
of the Massachusetts General Hospital in an 
industrialized stretch of Boston's riverfront — 
is home to an impressive collection of magnetic 
resonance imaging machines. In January, I slid 
into the newest of these, head first. The operator 
ran a few test sequences to see whether I experi- 
enced any side effects from the unusually rapid 
changes in this machine’s magnetic field. And, 
when I didn't — no involuntary muscle twitches 
or illusory flashes of light in my peripheral 


BY JON BARDIN 


vision — we began. The machine hummed, 
then started to vibrate. For 90 minutes, I held 
still as it scanned my brain. 

That scan would be one of the first car- 
ried out by the Human Connectome Project 
(HCP), a five-year, US$40-million initiative 
funded by the National Institutes of Health 
(NIH) in Bethesda, Maryland, to map the 
brain’s long-distance communications net- 
work. The network, dubbed the ‘connectome;, 
is a web of nerve-fibre bundles that criss-cross 
the brain in their thousands and form the bulk 
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of the brain’s white matter. It relays signals 
between specialized regions devoted to func- 
tions such as sight, hearing, motion and mem- 
ory, and ties them together into a system that 
perceives, decides and acts as a unified whole. 

The connectome is bewilderingly complex 
and poorly understood. The HCP proposes 
to resolve this by using new-generation mag- 
netic resonance imaging (MRI) machines, like 
that used to scan my brain, to trace the con- 
nectomes of more than 1,000 individuals. The 
hope is that this survey will establish a baseline 
for what is normal, shed light on what the vari- 
ations might mean for qualities such as intel- 
ligence or sociability, and possibly reveal what 


MGH HUMAN CONNECTOME PROJECT ACQUISITION TEAM 


The nerve fibres of the 
author’s brain were 
traced by diffusion 
spectrum imaging, and 
coloured to represent 
their direction. 


happens if the net- 
work goes awry. “We 
increasingly believe 
that brain disorders 
— from schizophre- 
nia to depression to 
post-traumatic stress 
disorder — are disorders of connectivity,’ says 
Thomas Insel, director of the National Institute 
of Mental Health (NIMH) in Bethesda anda 
strong supporter of the HCP. “So it is of vital 
importance that we have ways of detecting and 
quantifying these connections.” 

Yet many wonder whether the NIH is mak- 
ing a mistake. Researchers have yet to prove 
that MRI techniques can produce a reliable 
picture of normal connectivity, never mind 
the types of abnormal connection likely to be 
found in brain disorders, and some researchers 
argue that the techniques have not been ade- 
quately validated. “I would do the basic neuro- 
science before I started running lots of people 
through MRI scanners,’ says David Kleinfeld, 
a physics and neurobiology researcher at the 
University of California, San Diego. 


THE GRAND CHALLENGE 

Proponents counter that the HCP isa calculated 
risk. “No one thinks this is going to produce 
a wiring diagram like you might have for 
the electricity in your house,’ says Insel. But so 
little is known about the connectome, he says, 
that even crude maps would represent a major 
scientific advance. 

The decision to take that risk was made by 
the NIH’s Blueprint for Neuroscience Research, 
set up in 2004 as a collaboration among the 15 
NIH institutes, centres and offices with an 
interest in nervous-system research. In 2009, 
after five years of funding smaller projects, the 
group asked officials from across the NIH to 
submit ideas for ‘grand challenges’ in neuro- 
science: large-scale programmes that, Insel 
says, “would be both extremely high-impact, 
and virtually impossible with traditional grant 
mechanisms’. 

The Blueprint group received a dozen sub- 
missions, including one from Michael Huerta, 
then a programme officer at the NIMH anda 
member of a Blueprint subcommittee. Huerta, 
nowat the NIH’s National Library of Medicine, 
began his research career studying the organi- 
zation of mammalian brains using old-school 
anatomical and neural-tracing techniques, 
which typically require the injection ofa tracer 
compound that migrates along nerve fibres and 
reveals their routes. So he was all too familiar 
with the barriers to such studies in humans. For 
ethical reasons, tracers can only be used post- 
mortem — when they dont migrate far enough 
to trace a fibre’s full length. “The studies just 
never panned out,’ says Huerta. 

In 2007, Huerta became fascinated by two 
new non-invasive imaging methods that might 
finally allow researchers to study the finer 
details of connectivity in the brains of living 


humans. The first was diffusion-spectrum 
imaging (DSI), developed in 2005 by Van 
Wedeen, a radiologist at the Martinos Center, 
and his colleagues’. DSI is a refinement of the 
two-decades-old diffusion tensor imaging tech- 
nique, which exploits MRI’s ability to detect the 
direction in which water molecules are mov- 
ing at each point in the brain. Because most of 
those molecules move along the lengths of nerve 
fibres, like water through a pipe, the data can be 
used to reconstruct each fibre’s location and tra- 
jectory. What DSI adds is a more sophisticated 
form of signal analysis that allows researchers to 
continue tracing fibre bundles even when one 
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slice, he showed neurons that originated in the 
left cortex, then branched out and sent fibres 
to areas on both the left and the right side of 
the brain. “The brain is not made up of point- 
to-point connections,” he said. “Its made up 
of trees.” 

This level of connectome structure is invisible 
to even the most advanced diffusion-imaging 
methods, says Mitra, who heads the Mouse 
Brain Architecture Project, a parallel version of 
the HCP, funded by the NIH and the W. M. Keck 
Foundation of Los Angeles, that seeks to gener- 
ate a whole-brain wiring diagram for the mouse 
using staining techniques. And the problem is 


“We increasingly believe that brain 
disorders are disorders of connectivity.” 


seems to pass behind another, a situation that 
posed serious problems for the older technique. 

The second method that caught Huerta’s 
attention was resting-state functional MRI (rs- 
fMRI), in which people think about nothing in 
particular while their brain activity is meas- 
ured. This is quite different from conventional 
functional-imaging studies, in which partici- 
pants are asked to carry out a specific cognitive 
task and researchers look for the brain regions 
that are activated in the process. In rs-fMRI, 
there is no task, and researchers look for cor- 
relations among the activity levels in different 
areas. The presumption is that any two regions 
with a consistently high correlation are linked 
— perhaps by an actual bundle of nerve fibres, 
but certainly by working together in some way. 

The application of both DSI and rs-fMRI had 
already led toa number of high-profile publica- 
tions. But Huerta realized that few groups were 
applying both methods in the same subjects, 
and most studies used small samples, limiting 
their generalizability. So he proposed that the 
Blueprint group fund a Human Connectome 
Project that would apply both methods to 
hundreds of people. This would allow the first 
large-scale comparison to be made between 
structural connectivity, as determined by DSI, 
and functional connectivity, as determined by 
rs-fMRI. “No single neuroimaging approach 
would give you the type of gold-standard con- 
nectivity data you need,” says Huerta, recalling 
his argument for the dual data sets. 

The Blueprint group was intrigued, but was 
not blind to the problems inherent in these tech- 
niques. One obvious issue is DSI’s spatial reso- 
lution: each fibre bundle in the image contains 
thousands of neurons, meaning that it would 
miss a great deal of structure on smaller scales. 

Partha Mitra, a neuroscientist at Cold Spring 
Harbor Laboratory in New York, illustrated the 
problem to me by displaying a series of high- 
resolution digital pictures of mouse brain 
slices, each of which had some of its neurons 
coloured with a dark brown dye. On one such 


made even worse when the data are converted 
into a ‘connectivity matrix; which seeks to quan- 
tify how much every point in the brain is con- 
nected to every other point — but can't tell the 
difference between, say, two separate fibres and 
one fibre with two branches. 

The Blueprint group was also aware of con- 
cerns about resting-state scans. As with the 
more familiar form of f{MRI, what is actually 
measured isn't neural activity itself, but blood 
flow. The general presumption is that the two 
quantities are closely related — that blood flow 
increases in a region of the brain whenever the 
neurons there are active and need to be sup- 
plied with more oxygen. But recently, Klein- 
feld points out, several studies have called that 
assumption into question, showing that some 
increases in blood flow in the brain occur with- 
out an increase in neuronal activity”. “There 
is no simple one-to-one relationship,’ he says. 


A REMAINING CONCERN 

That makes rs-fMRI studies particularly hard 
to interpret, Kleinfeld adds, if only because the 
brain’s resting-state activity may fluctuate on 
the same timescales that its blood vessels do. A 
recent review’ of rs-fMRI admits that this vas- 
cular fluctuation “remains a concern’. Other 
studies show that even something as simple as 
a subject’s pattern of breathing’ or slight move- 
ments of the head? can significantly confound 
rs-fMRI measurements. 

Even leaving the technical challenges aside, 
there was no assurance that collecting the con- 
nectomes of hundreds of individuals would 
lead to interesting generalizations. “You could 
certainly imagine situations in which every- 
one’s wiring diagrams are quite different,’ says 
Gregory Farber, the programme officer at the 
NIMH who manages the connectome project. 
Nonetheless, the Blueprint group was swayed. 
by the argument that imperfect data are bet- 
ter than no data. “The committee asked, ‘Will 
we have better methods in five years?” recalls 
Huerta. “I’m sure we would. But if we followed 
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SCANNING THE CONNECTOME 


The Human Connectome Project aims to trace the brain’s long-range communication network using two main 
techniques, both of which rely on magnetic resonance imaging (MRI) to obtain data from living people. 


Mapping structure 

Diffusion spectrum imaging detects the movement 
of water molecules that flow along nerve fibres in 
the brain. The result is a map of the brain’s 
neuronal network. 


Colours represent 
direction of neurons 


Mapping function 


Resting-state functional MRI maps resting brain 
activity, then looks for correlations between one 
area and another. Highly correlated areas are 
thought to have some kind of functional link. 


Highly 
correlated 
with node 


Weakly 
correlated 
with node 


that rule, no science would ever get done” 

The group also liked the fact that the find- 
ings would be broadly applicable to clinical, as 
well as scientific, questions. “We thought we 
could do something like what we did with the 
Human Genome Project,’ Insel says, “because 
once you have that map of the brain you can 
compare it to similar maps across develop- 
ment, or to maps of subjects with different 
disorders of brain circuitry.” 


A BLUEPRINT FOR THE BRAIN 
In July 2009, the Blueprint group announced its 
choice of the HCP as one of three grand chal- 
lenges — the other two focused on pain and 
on drugs for nervous-system disorders — and 
simultaneously put out a request for proposals. 
On 15 September 2010, the NIH announced 
that it would be funding two HCP proposals. 
The larger of the two is a 5-year, $30-million 
effort led by David Van Essen, a neurobiologist 
at Washington University in St Louis, Missouri, 
and Kamil Ugurbil, an {MRI pioneer at the Uni- 
versity of Minnesota in Minneapolis. (Another 
collaborator is Olaf Sporns, a neuroscientist 
at Indiana University in Bloomington, a co- 
author on the 2005 review article that coined 
the term ‘connectome’’.) During phase one, 
now nearing completion in Minneapolis, this 


The brain has many areas specialized 
for specific functions, some of which are 
shown here. 


Data on 
structure and 
function can be 
combined and 
analysed using 
tools such as 
network theory. 


The connectome ties these areas together, 
allowing the brain to function as a coherent 
whole. The project’s goal is to understand 
how the connectome works. 


team has developed a scanner that will be able 
to double the resolution of standard MRI. 

Once complete, that scanner will be moved 
to Washington University, where it will imme- 
diately begin high-throughput scanning. The 
plan is to use both DSI and rs-f{MRI (see ‘Scan- 
ning the connectome’) to study 1,200 people: 
300 identical twins, 300 non-identical twins 
and 600 non-twin siblings. This will allow 
researchers to explore how much of the brain’s 
connectivity is mapped out by genes. Volun- 
teers will also complete behavioural tests and 
other fMRI, magnetoencephalogram and 
electroencephalogram protocols, so that brain 
structure can be further correlated with func- 
tion. All these data will be made public, allow- 
ing unaffiliated researchers to answer their 
own questions, and Van Essen’s group plans to 
release a set of new data-analysis tools. Con- 
nectomics, Van Essen says, “has been a cottage 
industry. But we expect this project to allow for 
a much richer, more unified approach”. 

The smaller HCP project — a 3-year, 
$8.5-million effort led by Bruce Rosen, a radi- 
ologist at the Massachusetts General Hospital, 
and Arthur Toga, a neurologist at the Univer- 
sity of California, Los Angeles — involved 
building a new fMRI scanner optimized for 
the collection of fibre-tracking data. The idea 
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was to massively increase the gradient strength 
of the machine — a measure of how rapidly the 
MRIs magnetic field varies from point to point 
in the brain. A more intense gradient is like “a 
bigger mirror ina telescope’; says Wedeen, who 
is director of connectomics at the Martinos 
Center. It simultaneously makes the instru- 
ment more sensitive to faint signals, and gives it 
a higher resolution. The machine has now been 
built — it is the one that collected images of my 
brain in January — but will require much more 
tweaking and testing before it is optimized for 
routine use. But the researchers have already 
achieved a tenfold increase in sensitivity to the 
water-diffusion signal, allowing their scanner 
to trace connections much more precisely than 
the best off-the-shelf machines. 

Ina press release announcing the launch of 
the HCP in July 2009, Insel said that the project 
would “map the wiring diagram of the entire, 
living human brain” and that this map could be 
linked to “the full spectrum of brain function 
in health and disease”. Such lofty ambitions 
may or may not succeed in five years. But the 
project still has its place, says Sebastian Seung, 
a computational neuroscientist at the Massa- 
chusetts Institute of Technology in Cambridge, 
who studies brain connectivity at the cellular 
level. “I think it is a mistake to think we have 
to look at every cell in every region of the brain 
to make scientific progress,’ says Seung, who is 
not involved in the HCP. 

But he also emphasizes that the HCP’s con- 
nectivity map will be, at best, a beginning. 
“That is just going to tell us where to look,” 
he says. “Then we need to study actual cells 
to learn more’, to figure out how the brain’s 
networks actually transmit information. 

A week after my visit to the Martinos Center, I 
received my DSI data. Using free software from 
the centre, it is easy to explore the architecture 
of my brain. I can clearly see my hippocampus, 
and the vast array of fibres projecting from the 
midbrain sensory hubs up to my cerebral cortex. 
Iam overwhelmed by the visible detail and obvi- 
ous organization. At present, it is just a pretty 
picture — a novelty to show friends. But, I won- 
der: once scientists know what ‘average’ looks 
like, and once they understand the variations, 
what, if anything, will this rainbow-coloured 
highway map of my brain say about me? m 


Jon Bardin is a freelance writer based in New 
York. 
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Observatories of the mind 


An ambitious project to map the mouse brain at the Allen Institute for Brain Science is 
a huge undertaking that may unify neuroscience, argue Christof Koch and R. Clay Reid. 


euroscience is a splintered field. 
Ns 10,000 laboratories worldwide 
are pursuing distinct questions about 
the brain across a panoply of spatio-tempo- 
ral scales and in a dizzying variety of animal 
species, behaviours and developmental 
time-points. At any large neuroscience meet- 
ing, one is struck by the pace of discovery, 
with 50,000 or more practitioners heading 
away from each other in all directions, in a 
sort of scientific Big Bang. 
Although this independence is necessary, 
it has prevented neuroscience from entering 
a more mature phase, which would involve 


developing common standards and col- 
laborative projects. Neurophysiologists are 
more likely to use each other's toothbrushes 
than each other’s data and software; physi- 
ological results are hoarded and rarely made 
accessible online; molecular compounds and 
transgenic animals are shared only after pub- 
lication. All of this has made comparisons 
across laboratories difficult and has slowed 
progress. 

At the Allen Institute for Brain Science in 
Seattle, Washington, we and our colleagues 
are initiating an experiment in the sociol- 
ogy of neuroscience — a huge endeavour 
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that will involve several hundred scientists, 
engineers and technicians at the institute. 
Philanthropist Paul G. Allen, who 
founded the institute in 2003, has pledged 
US$300 million for the first four years of an 
ambitious ten-year plan that will accelerate 
progress in neuroscience, bringing his total 
commitment so far to $500 million. Our 
goal is to attract the best young scientists 
and build a series of ‘brain observatories, 
with the aim of identifying, recording and 
intervening in the mouse cerebral cortex, the 
outermost layer of the brain. Unlike the tele- 
scopes that peer at remote events in space 
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> and time, our instruments will track the 
flow of information in complex, interbraided 
neural circuits within a layer of tissue one 
millimetre thick. 

The goal is to synthesize knowledge from 
all relevant disciplines — from genomics, 
anatomy and physiology to computer mod- 
elling — into a comprehensive theory of how 
the mouse uses its cortex to see. Following 
Allen’s founding mandate for the institute, 
we will curate and publicly share all data and 
other resources (such as transgenic animals, 
probes and neuronal morphologies) for free, 
even before publication. 

We believe that this initiative signals the 
arrival of large-scale science in a field that 
is populated by small groups. It will require 
rewarding the team, rather than a few lead 
investigators, for its collective effort. We 
envisage a time in the future when sophis- 
ticated, well-equipped brain observatories 
— ‘mindscopes’ — will complement the 
many academic neuroscience labs that will 
continue to make the bulk of discoveries. 

Although the institute's project will focus 
on visual stimuli in mice, it will capture 
fundamental aspects of higher brain func- 
tions: perception, conscious awareness and 
decision-making, and how they lead to 
action. Once neuroscientists know the basic 
mechanisms in the mouse, they may start to 
understand more-complex forms of percep- 
tion in other animals, including humans. In 
short, we believe that this project has the 
potential to revolutionize our understand- 
ing of the mammalian brain. 


2020 VISION 
The brain observatories initiative focuses 
on the cerebral cortex — the proverbial ‘grey 
matter’ that is one of the hallmarks of mam- 
mals, and the seat of cognition and intel- 
ligence. For its size, the cortex is the most 
complex piece of organized matter in the 
known Universe. But given a grain of cortex, 
it is not easy to tell from which mammal it 
came. We all have the same basic hardware — 
humans just happen to have a thousand times 
more of it than mice (see ‘Man versus mouse). 
The project will start with the anatomy of 
cortical neurons, using molecular techniques 
to count and catalogue the diverse array of 
cells that transmit information into and out 
of the visual cortex, as well as those that lie 
solely within the cortex. Then we will observe 
and measure the electrical activity of a size- 
able fraction of those neurons to learn how 
the function of the cortex follows from its 
structure. It isa big question, but less daunt- 
ing if one considers the cortex as being com- 
posed of many copies 
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MAN VERSUS MOUSE 
Comparing the human and mouse brain reveals why mapping the mouse brain is easier. 

Human Mouse 
Mass of brain 1,500 grams 0.5 grams 
Number of brain neurons 86 billion 70 million 
Number of cortical neurons 16 billion 14 million 
Fraction of cortex that is visual 20% ~10% 
Cortical regions involved in vision 30 10 
Number of neurons in these areas 5 billion 1 million-2 million 
Axons in optic nerve 1 million 45,000 


imaging, which reveals brain regions 
activated in real time. However, the under- 
lying signals are slow and fuzzy, and brain 
regions are not the building blocks of the 
mind — neurons, in their immense complex- 
ity, are the atoms of perception and thought. 

Fortunately, scientists can now study indi- 
vidual mammalian neurons in living brains, 
by making neurons fluoresce using genetic 
engineering. Neurons can be made to emit 
flashes of light when they are active or, using 
the most recent optogenetics techniques, can 
be controlled with light (named Method of 
the Year in 2010 by Nature Methods’). In this 
way, researchers can transiently and revers- 
ibly control certain events in specific cell 
types at defined times. Perturbing the nerv- 
ous system lets neuroscience move from cor- 
relation to causation, from observing that a 
given circuit is activated whenever the ani- 
mal makes a decision to inferring that this 
circuit is necessary for decision-making. 

Other technological advances enable 
neuroscientists to visualize every axon, den- 
drite and synapse in a circuit using three- 
dimensional electron microscopy. Another 
observatory in the visual-brain programme 
is designed to record the electrical activity 
of thousands of neurons simultaneously. 
When all of these techniques are stitched 
together, one can imagine a full physiologi- 
cal and structural characterization of entire 
brain regions. 

Finally, we will use all of this information 
to generate realistic and dynamic computer 
models of the mouse cortex and its associ- 
ated structures, and use these to build theo- 
ries of how the visual cortex behaves. The 
modelling facility will be next door to the 
labs so that collaborators can mingle easily. 
This should shorten the time between pre- 
diction and experimental validation, a virtu- 
ous circle that will be iterated until the model 
faithfully reproduces the data. 


A CALCULATED RISK 

Even though neuroscientists have spent the 
past 120 years uncovering the constitutive 
elements of the mammalian brain, we are far 
from understanding how any higher brain 
function comes about. Yet such knowledge 
is vital, especially because of the toll that 
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nervous-system injuries and disorders take 
on individuals, families and society at large. 

Although the challenge may seem daunt- 
ing, we are not the first to propose such a 
huge undertaking. For example, the Euro- 
pean Human Brain Project, under the leader- 
ship of Henry Markram at the Swiss Federal 
Institute of Technology in Lausanne, hopes 
to bring together some 150 principal inves- 
tigators in 22 countries to create working 
computer models of the mouse, rat, monkey 
and human brains’. The brain observatories 
programme at the Allen Institute differs in 
that we are focusing on a single sense, in a 
single species, in one institution. 

We know that such an expensive project 
will have critics: the resources required 
could fund hundreds of other projects, so 
why focus them in this way? Our response 
is that funding agencies are already spending 
billions of dollars on many smaller projects 
across all areas of biomedical research, and 
the Allen Institute wants to pioneer a new 
approach. We want to understand one piece 
of brain tissue by integrating knowledge 
across techniques and scales, rather than 
distributing the funds more widely. 

There is a risk that this project will not 
work out as we anticipate, and that the vari- 
ous brain observatories — looking at anat- 
omy, physiology and modelling, for example 
— will not synergize to form a sophisticated 
understanding of the mouse visual cortex. 
There is no guarantee that neuroscience is 
ready to become big science — but the only 
way to find out is to try. m 


Christof Koch is at the Allen Institute for 
Brain Science, Seattle, Washington 98 103, 
USA, and at the California Institute of 
Technology, Pasadena, California 91125, 
USA. R. Clay Reid is at Harvard Medical 
School, Boston, Massachusetts 02115, USA, 
and will join the Allen Institute this summer. 
e-mail: christofk@alleninstitute.org 
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Think big for marine conservation 


Err on the side of caution and protect the widest- possible areas of ecologically 
important deep sea, say Phil Weaver and David Johnson. 


arine protected areas are key tools 
Me conservation, but they have 

some serious shortcomings. The 
Convention on Biological Diversity (CBD) 
has called for these areas to cover 10% of 
each of the world’s marine and coastal eco- 
regions by the end of 2020. Even this mod- 
est target is proving elusive. So far, less than 
2% of the ocean has been designated as pro- 
tected, and nearly all of these areas are in 
coastal and continental-shelf regions. This 
is partly the result of a lack of data from the 
open ocean, and partly because 
of pressures from various inter- 
est groups, which may resist 
the management of ocean areas 
with valuable resources. 

There is now an additional 
option for protecting the marine 
environment. In 2010, the CBD 
created a process to officially 
endorse Ecologically or Biologi- 
cally Significant Areas (EBSAs) 
and to convey information to 
competent intergovernmen- 
tal organizations, such as the 
United Nations General Assem- 
bly, for further action. The point 
of EBSAs is to allow scientists to 
identify areas that are particu- 
larly important to the function 
of marine ecosystems without the require- 
ment for an accompanying detailed manage- 
ment plan. This ‘softer’ procedure opens the 
door to labelling a larger swathe of the ocean 
as important to ecosystem functioning. 

An EBSA is defined by a set of criteria 
established by the CBD in 2008: the area 
should contain unique, rare or endemic 
creatures and/or habitats; have a special role 
in the survival of a given species; be impor- 
tant for the survival or recovery of threat- 
ened species; be vulnerable, fragile or slow 
to recover once harmed; have high biologi- 
cal productivity; and/or have high biological 
diversity. These are good criteria. Unfortu- 
nately, there are vast swathes of ocean about 
which very little is known, for which these 
criteria can be hard to prove. 

This lack of knowledge is not preventing 
fishing, deep-sea mining and other exploita- 
tion from expanding, however; nor should it 
stand in the way of designating EBSAs. We 
believe that EBSAs should be made as large 
as possible, encompassing all areas within 
which the criteria are likely to be substan- 
tively met, even before that can be proven. 


This approach should be widely adopted by 
the scientific community now, while pro- 
posed EBSAs are being drawn up for a first 
technical evaluation at a CBD meeting in 
Montreal, Canada, at the end of April, before 
their political endorsement. 


ATLANTIC BEGINNINGS 

The first CBD regional workshop to identify 
EBSAs was held in September 2011 to con- 
sider the northeast Atlantic, which already 
hosts a number of marine protected areas 


Species such as North Atlantic anemones deserve conservation attention. 


and precautionary bottom-fisheries closures. 
This ocean includes the Hatton and Rockall 
banks and basin area, to the west of Ireland 
and Scotland. This area, parts of which are 
heavily fished, hosts many discrete habitats 
and supports a wide range of animals, includ- 
ing fragile cold-water coral reefs and sponges. 
It provides feeding grounds for birds such as 
shearwaters and petrels. And it may har- 
bour turtles and the endangered blue whale 
(Balaenoptera musculus) as well as the criti- 
cally endangered North Atlantic right whale 
(Eubalaena glacialis). The exact boundaries 
of any of the habitats and species distributions 
are yet to be established. 

Initially, there was considerable support 
among the 25 participating scientists for the 
designation of small, discrete EBSAs, such as 
isolated seamounts, in the northeast Atlan- 
tic. Experts were focused on their particular 
specialist habitat or species group, and were 
conscious that their expert judgement would 
be scrutinized by their peers. Under such 
circumstances, it can be difficult to support a 
large EBSA that extends beyond one’ exper- 
tise and into areas where there are little data. 
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Fortunately, the eventual consensus of this 
meeting was to propose large EBSAs: eight 
extensive areas (averaging 362,097 square 
kilometres each) and two smaller inter- 
national bird areas. For Hatton—Rockall, 
that meant drawing a line around the entire 
banks region, measuring 264,322 km?. 
This stands in stark contrast to the aver- 
age size of the 276 protected areas in this 
ocean's national waters, which average just 
1,040 km? each, and the 6 high-seas pro- 
tected areas, at 47,718 km’ each. 

What should happen once an 
EBSA is defined? We suggest that 
a marine spatial plan be drawn 
up for each EBSA and regularly 
updated. This will articulate 
a vision, show what activity is 
taking place in the region (from 
commercial fishing to tourism) 
and study the impacts of those 
activities. In terms of manage- 
ment, we support a three-tier 
approach whereby areas that 
historically have been heavily 
fished and are now degraded 
remain unprotected; areas with 
light historical fishing are given 
full protection; and moderately 
fished areas are subject to fur- 
ther scrutiny. Marine protected 
areas could sit comfortably within EBSAs, 
giving protection to the most critical ecosys- 
tems. The main benefit of this system is that 
it appeals to many different stakeholders: for 
example, it legitimizes existing fishing activi- 
ties while preventing them from spreading to 
vulnerable ecosystems in future. 

As scientists meet to rationalize what 
exactly EBSAs should look like in different 
parts of the world, we urge them to err on the 
side of caution and to make the areas large 
rather than small. This will open the door 
to broader conservation measures and to 
sustainable development in future. m 


Phil Weaver is science coordinator of the 
Global Ocean Biodiversity Initiative and 

is at the National Oceanography Centre, 
Southampton, UK. David Johnson is 
executive secretary for the Convention for the 
Protection of the Marine Environment of the 
North-East Atlantic (OSPAR) in London. 
e-mail: ppew@noc.ac.uk 


Further reading accompanies this article online at 
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Acentury of cosmic rays 


High-energy nuclei coming from far beyond the Solar System, and the exotic 
particles they produce, remain our best window onto the extreme Universe. 
Michael Friedlander reflects on what we have learned. 


CC ( ; oming out of space and incident on 
the high atmosphere, there is a thin 
rain of charged particles known as 

the primary cosmic radiation.” With these 

words on the nature of cosmic rays, British 
physicist Cecil Powell began his Nobel prize 

lecture in 1950. 

Powell’s prize was awarded for his devel- 
opment of the photographic method of 
identifying high-speed and short-lived 
particles that were turning up unexpect- 
edly in cosmic-ray studies as the products 
of high-energy collisions. At the same time, 
that photographic method was being used to 
discover new components of Powell’s ‘thin 
rain’: heavy atomic nuclei. These two strands 
— the study of primary cosmic rays and the 
products of their collisions — continue to 
be woven into the fabric of today’s research. 

Although particle collisions are now stud- 
ied mainly through the use of giant parti- 
cle accelerators, the only window into the 
behaviour of the very highest-energy parti- 
cles comes from examining cosmic rays. The 
study of the primary cosmic radiation is a 
part of current astrophysics: by comparing 
the composition of cosmic rays with that of 
stars, we can identify their sources and use 
them to investigate violent stellar processes. 

This year, we celebrate the centenary of 
the discovery of cosmic rays by Austrian- 
American physicist Victor Hess. Over the 
decades, cosmic-ray research has spread in 
directions that he could never have imag- 
ined, from the discovery of antimatter to 
the use of carbon dating in archaeology. It 
has even played a crucial part in the origins 
of ‘big science’ 


RADIATION SOURCE 

Hess's research was carried out in the heady 
days following the discovery of radioactivity 
and the electron. In the early 1900s, a prime 
research tool in the study of radioactivity 
was the electroscope, a sensitive device for 
measuring the ionization produced by radi- 
ation. It was soon found that the radioactive 
components of some rocks produced ioni- 
zation, and most researchers believed that 
Earth’s crust was the source of background 
levels of radiation. To investigate, scientists 
lowered electroscopes into lakes and oceans, 
carried them up mountains and took them 
to even greater heights in open baskets 
underneath hydrogen-filled balloons. The 
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results were conflicting, with some showing 
a decrease in ionization with altitude, 
others an increase. It was during this 
confusing time that Hess, in 1911, started 
his own series of balloon flights. 


> 
Physicist Victor Hess on a balloon flight in 1912. 


Hess found that the ionization rate at first 
decreased with altitude, but then started to 
increase up to a height of 5.3 kilometres, the 
greatest height he reached. That flight took 
place from northern Bohemia (now part of 
the Czech Republic) on 17 April 1912, when 
a partial solar eclipse was visible from many 
parts of Europe. Hess detected no decrease 
in ionization during the eclipse, indicating 
that whatever the main source of the ion- 
izing radiation coming from above, it was 
not the Sun. 

Born in Austria in 1883 and educated at 
the University of Graz, Hess was a young 
assistant at the Radium Institute of the Aus- 
trian Academy of Sciences at the time of the 
flights. His discovery brought him a series 
of increasingly senior positions and grow- 
ing professional recognition, culminating 
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in a shared Nobel prize in 1936. With the 
deteriorating political situation in Europe, 
Hess was dismissed from his post at the 
University of Innsbruck in 1938 because he 
had a Jewish wife. He managed to escape 
from Austria, taking up a faculty post at 
Fordham University in New York. 

For some years after these legendary begin- 
nings, the nature of cosmic rays was strenu- 
ously debated by physicists. Robert Millikan 
(who coined the term ‘cosmic rays’ in 1925) 
continued to insist that they were electromag- 
netic ‘rays’, even after Arthur Compton had 
established that they were really ‘particles; 
as revealed by the way in which they were 
deflected by Earth’s magnetic field. 

The experimental study of cosmic rays has 
often moved ahead of theory, yielding a host 
of unpredicted discoveries. One of the most 
dramatic was the observation of particles of 
antimatter. Paul Dirac’s relativistic quantum 
theory had foretold the existence of antipar- 
ticles, and Dirac speculated that anti-atoms 
with anti-electrons might exist in distant 
anti-stars. But he made no predictions as to 
where to look for them on Earth: certainly 
not among cosmic rays. 

In 1911, a tool for the study of cosmic rays 
had been developed by Scottish physicist 
C.T.R. Wilson. Wilson realized that water 
droplets were formed in the atmosphere by 
condensation of vapour on ions (an obser- 
vation inspired by watching mist form on 
the summit of Ben Nevis in Scotland). He 
converted this insight into a powerful lab- 
oratory-scale device — the cloud chamber 
— in which the passage of charged particles 
was made visible by their trails of liquid 
droplets. In 1932, Carl Anderson was using 
a Wilson cloud chamber with a large mag- 
net to study cosmic rays when he observed 
a particle that had the mass of an electron, 
but a positive charge. The discovery of the 
positron, as Anderson named the particle, 
was recognized when Anderson shared the 
1936 Nobel prize with Hess. 

During the period 1947-56, when cosmic- 
ray studies resumed after the Second World 
War, a host of unpredicted subatomic par- 
ticles including hyperons, pions and kaons 
was found using photographic emulsions and 
Wilson cloud chambers. This complex mix 
of particles, called the particle zoo, forced a 
complete upheaval in particle theory. 

Meanwhile, the study of the primary 


SPL 


A.SAFTOIU/ASPERA 


cosmic radiation itself was advancing, and 
nuclei much heavier than helium were 
discovered. With further recent improve- 
ments to experimental techniques, the 
relative proportions of different cosmic-ray 
nuclei, and even some of their isotopes, have 
been precisely determined. 

The proportions of the different chemi- 
cal elements among cosmic rays can be 
compared with their abundances in the 
Solar System, in the atmospheres of distant 
stars and among the remnants of supernova 
explosions, to identify objects and regions 
where cosmic rays originate. Researchers 
also seek to identify the regions in which 
cosmic rays are accelerated to enormous 
energies, producing particles that travel 
close to the speed of light. The largest cor- 
responding kinetic energy measured for a 
single cosmic-ray particle is comparable to 
that of a cricket ball or baseball travelling 
at 160 kilometres per hour. This energy is 
more than 100 million times larger than that 
of protons accelerated in the Large Hadron 
Collider at CERN, Europe’ particle-physics 
laboratory near Geneva, Switzerland. 

These ultra-high-energy particles are rare 
— only a few arrive each century over each 
square kilometre of Earth. Through their 
collisions in the atmosphere they generate 
billions of particles, requiring many detec- 
tors spread out over large areas. High-energy 
y-rays, also produced in cosmic-ray sources, 
can similarly be detected by large-area arrays. 
Their arrival directions can point back to 
their sources, such as supernova remnants 
and active galaxies. Continued exploration 
of these highest-energy particles and photons 
might tell us about conditions in the early 
and very hot stages of our Universe. 

Cosmic-ray studies have expanded in 
unanticipated directions. For example, cos- 
mic rays have been identified as the source of 


the radioactive isotope carbon-14, produced 
by collisions with atmospheric nitrogen. The 
amount of carbon-14 produced in the atmos- 
phere depends on the numbers of cosmic rays 
reaching Earth, which in turn depends on the 
11-year cycle of solar activity. Measurement 
of carbon-14 has revolutionized archaeology 
by enabling the ages of ancient organic matter 
to be determined. 


COSMIC CONSEQUENCES 

Hess's discovery came from observing the 
effects of ionization produced by cosmic 
rays. That same effect is taking place in our 
bodies as cosmic rays pass through them. 
Over our lifetimes, we accumulate a radia- 
tion dose that causes biological damage, 
presumably contributing to a basic level 
of cancer production. Unshielded by the 
atmosphere, astronauts accumulate radia- 
tion doses from cosmic rays that may well 
exceed those considered safe. This could 
limit the distances to which astronauts can 
go as they explore the Solar System. 

Today, the scale of physics research has 
expanded to the point at which it is not 
unusual for a single scientific paper to have 
hundreds of authors, crossing interna- 
tional boundaries and using internation- 
ally funded equipment. The origin of this 
revolution can be traced to cosmic rays. The 
cost and manpower demands of cosmic-ray 
research in the 1950s, although modest by 
today’s standards, were beyond the capacity 
of any single group. The ‘G-stack’ collabora- 
tion, for example, of which I was a part (in 
Powell’s research group at the University 
of Bristol, UK), was created to undertake 
the flight of a ‘giant’ stack of photographic 
detectors beneath a balloon. The special 
photographic emulsions, made by the 
photo company Ilford, were processed in 
Bristol; the balloon was flown in northern 


Cosmic-ray collisions create showers of particles, requiring detectors spread out over a vast area. 
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Italy; and the measurements and analysis 
were carried out by groups in Bristol, Brus- 
sels, Copenhagen, Dublin, Genoa, Milan 
and Padua. Our results, including many 
examples of new and very short-lived 
particles, were reported in a 1955 paper 
(J. H. Davies et al. II Nuovo Ciménto 2, 
1063-1103; 1955) that carried the names 
of 36 scientists, by far the largest number 
of co-authors up to that time. 

This style of large international collabora- 
tions is today exemplified by CERN, which 
was founded in 1954 and houses the largest 
particle accelerator ever built. Prominent 
among CERN’s founders were many cosmic- 
ray scientists, including Powell and Edoardo 
Amaldi, its first director of research. 

One lesson to be 


“After learned from cosmic- 
100y ears, ray research is the need 
cosmic-ray to examine carefully 
research is any rare but apparently 
mature but strange observations, 
still open to and not to discard 
producing them as part of the 


surprises.” background noise that 
many particle-physics 
experiments accumulate. The discovery of 
antimatter rested on the detection ofa single 
track of a lone positron. Similarly, the dis- 
covery of some kaons was based on obser- 
vations of single events. Although some 
discoveries may emerge from the statistical 
analysis of large quantities of data, we should 
remember that important discoveries can 
still be established by a single observation. 

After 100 years, cosmic-ray research is 
mature but still open to producing surprises. 
Cosmic rays continue to be studied from 
balloons, Earth-orbiting satellites and long- 
range space probes as well as ground-based 
detectors that cover enormous areas, seek- 
ing the sources of the highest-energy cosmic 
rays. Antiparticles might also be identified. 
Longer flights and larger areas are permit- 
ting the accumulation of more data on par- 
ticles and cosmic y-rays, thus increasing the 
detection of yet more of the rarest events. 

There are already suggestions that some 
of the highest-energy particles and y-rays 
come from well-known objects such as 
some supernova remnants. More data might 
locate, more firmly, the directions in which 
their sources are located and should define 
the acceleration processes. Perhaps the phys- 
ical conditions are even more exotic than we 
can imagine at present. This field of astro- 
particle physics seems sure to produce future 
Nobel prizewinners. = 
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TOM COGHILL 


| COMMENT BOOKS & ARTS 


Michael Mann has been harassed personally and politically over his work on climate change. 


CLIMATE SCIENCE 


At the storm front 


Simon Lewis gets to grips with a climate scientist’s 
account of a conflict that began with a graph. 


ow would you feel if a powerful 
H politician demanded, with appar- 
ent legal authority, that you supply 
him with every piece of scientific data you 
had ever collected, every computer program 
you had written, and every detail that would 
allow his staff to replicate your work? How 
would you feel if an envelope of suspicious 
white powder was posted to you — and the 
police sealed your office? And if your family 
was threatened with violence? 
Michael Mann knows how he would feel: 
he experienced all this after publishing a 
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scientific paper. His The Hockey Stick and 
the Climate Wars tells the story of one of the 
most harassed scientists in the United States. 

The book begins slowly, moving from 
happy anecdotes of childhood computer 
programming to the joys of responsibility- 
free postdoctoral research. But it becomes 
riveting when we reach what was in retro- 
spect the turning point in Manns life: his 
1998 paper ‘Global-scale temperature pat- 
terns and climate forcing over the past six 
centuries’ (M. E. Mann et al. Nature 392, 
779-787; 1998). 
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Manns study used 
proxy climate records, 
such as tree-ring 
growth and the ratios 
of oxygen isotopes 
in corals, to make 
estimates of sur- Ti 
face air temperature ~ | } 
extending backtoaD the Hockey Stick 
1400 — long before and the Climate 
direct thermometer Wars: Dispatches 
measurements were from the Front 
available. The results Lines 
showed that in the MICUAELEMANN 

P Columbia Univ. Press: 
Northern Hemi- 5972 384 pp. $28.95, 
sphere, temperatures £79.95 
were higher in the last 
years of the twentieth century than at any 
time in the previous 600 years, and that the 
suspected dominant cause was anthropo- 
genic carbon dioxide emissions. 

Manns result came to be known as the 
hockey stick, because the key graph showed 
a flat ‘handle’ — the low-level natural oscil- 
lations in temperature in the centuries before 
humans began to create large quantities of 
greenhouse gases — and a ‘blade’ repre- 
senting the sharp uptick in surface air tem- 
perature in the twentieth century. The work 
invoked the ire of lobby groups opposed 
to political action on climate change, who 
launched an astonishing decade of attacks on 
the veracity of the result. The bulk of Mann's 
book documents this ugly debate. 

Mann ably dissects how ideologically 
driven, industry-funded campaigns attack 
climate scientists and their work to obstruct 
the formulation and implementation of 
policies restricting CO, emissions. We learn 
about Mann’s bemusement when he is first 
assailed, and later his resigned anger. After 
telling how, in 2006, a US National Academy 
of Sciences panel found no evidence that 
Mann’s hockey stick was flawed, he notes 
dryly that one “might think that this would 
have ended such charges once and for all. 
One would be wrong.” 

The influence of lobby groups on public 
perception of scientific issues is covered in 
more scholarly detail in the 2010 book Mer- 
chants of Doubt (Bloomsbury) by historians 
Naomi Oreskes and Erik Conway. However, 
Mann’s shocking first-hand testimony of the 
repeated attempts to discredit him and his 
work gives his book power. Mann reports his 
‘highs, such as when more than 30 scientific 
organizations declared their support for him 
after Senator Joe Barton (Republican, Texas) 
sent him an aggressive letter demanding, 
among other things, copies of his records. 
And we get the ‘lows’ of what Mann calls the 
“most malicious of the assaults”: the publica- 
tion in November 2009 of a cache of e-mails, 
including some from Mann, that had been 
stolen from the University of East Anglia in 
Norwich, UK. 


Although there is little in the book with 
which I seriously disagree, it left me uneasy. 
Many scientists will agree with Mann's three 
basic points: that climate change is a major 
societal problem; that there are campaigns 
to convince the public that this is not the 
case; and that scientists should engage 
with society and not allow the public to be 
“confused and misled by industry-funded 
propaganda”. 

However, I am unconvinced that present- 
ing the attacks as a “climate war” waged by 
a “corporate-funded denial machine” is the 
best way to help scientists to counter mis- 
information. It makes good copy, but the 
war metaphor, with its talk of battle scars 
and front lines, is unlikely to be an ideal 
communication strategy: it is, by defini- 
tion, polarizing. Because denial that climate 
change poses a problem for society is associ- 
ated mainly with right-wing political views, 
science communication needs to transcend 
ideological divides, not reinforce them. 

Last year saw the end of a similarly ugly 
climate-change spat, following an ideology- 
transcending innovation that Mann does not 
mention. After years of bitter dispute about 
estimates of the warming documented by 
direct thermometer measurements, the 
Berkeley Earth Surface Temperature project 
in California brought 
together vocal crit- 
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computer code. 

Unsurprisingly, the results confirmed pre- 
vious analyses. This reminded the public of 
an important result, provided a well-publi- 
cized vindication of it and, by involving key 
critics, dealt deftly with those who choose 
to confuse the public. A comparable effort 
to estimate the past millennium’s surface air 
temperature could be similarly decisive. 

An influential Editorial in Nature, cited 
by Mann, noted that climate scientists are 
ina “street fight” (Nature 464, 141; 2010) — 
imagery that I think is more apt than that of 
war. Street fighting is not about taking sides, 
and the metaphor describes well the ‘anything 
goes’ attacks that scientists can face when 
publishing or speaking about climate change. 

Mann's story is important. However, it is 
also important to step back from the conflict. 
Emphasizing a binary war might not be the 
best way to end it. = 


Simon L. Lewis is a reader in global change 
science at University College London, UK, 
and the University of Leeds, UK. 

e-mail: s..lewis@leeds.ac.uk 


Books in brief 


Opium: Reality’s Dark Dream 

Thomas Dormandy YALE UNIVERSITY PRESS 376 pp. $40 (2012) 
‘Poppy tears’ — the sap of the opium poppy — are aptly named: the 
drug has enslaved multitudes and sparked violence for 6,000 years. 
Yet opium and derivatives such as morphine and heroin have been 
stars of medicine and inspired greats such as the Romantic poet 
Samuel Taylor Coleridge and, more arguably, surgeon William 
Stewart Halsted. Those stories are threaded through medical 
historian Thomas Dormandy’s engrossing chronicle, but we’re never 
far from the shadows — the dosing of infants in Victorian England, 
the torments of addiction, and wars in Afghanistan and China. 


Megachange: The World in 2050 

Edited by Daniel Franklin and John Andrews PROFILE/WILEY 320 pp. 
£15/$34.95 (2012) 

The Economist editors Daniel Franklin and John Andrews give the 
“helicopter view” of where the world will be by mid-century. Aiming 
to pin down the trends forcing “megachange”, they offer 20 chapters 
that make predictions in everything from public health and women’s 
welfare to social networking, climate change and, inevitably, 
economics. This succinct, pithy resource has surprises: by 2050, 
China’s gross domestic product will be 80% more than the United 
States’s, for instance; and the world’s total fertility rate could stabilize. 


Circulation: William Harvey’s Revolutionary Idea 

Thomas Wright CHATTO & WINDUS 272 pp. £16.99 (2012) 

It is easy to forget that William Harvey’s great theory on the circulation 
of blood was as groundbreaking as Copernicus’s on the Solar System. 
Working mainly from dissections and vivisections, Harvey also drew 
insight from his observations of networks and systems in transport 
and technology. Author Thomas Wright’s account has a brilliant cast 
— including John Donne, the great metaphysical poet and dean of St 
Paul’s Cathedral, London, whose interest in anatomy and the human 
heart drew him to Harvey’s work. A classic example of how great 
science affects culture, language and politics. 


Learning from the Octopus: How Secrets from Nature Can Help Us 
Fight Terrorist Attacks, Natural Disasters, and Disease 

Rafe Sagarin BASIC BOOKS 320 pp. $26.99 (2012) 

How are tide pools linked to national security? Ecologist Rafe Sagarin 
says that adaptability in nature — particularly in the octopus — 
exemplifies key defence principles. For example, the cephalopod 

has both attack and defence capabilities, with a powerful bite and 
inky camouflage, and its flair for escapology in the lab shows that it 
can handle a change in environment. Drawing on life science and 
evidence from the military and emergency services, Sagarin defines 
adaptability as the “sweet spot” between reaction and prediction. 


Clouds That Look Like Things: From the Cloud Appreciation 
Society 

Gavin Pretor-Pinney SCEPTRE 112 pp. £12.99 (2012) 

Anyone who has spent long minutes staring idly into the blue will 
relish these bingo moments gathered by Gavin Pretor-Pinney, 
author of The Cloudspotter’s Guide and The Wavewatcher’s 
Companion, which won the 2011 Royal Society Winton prize for 
Science Books. From Alfred Hitchcock to flying saucers and a 
menagerie of beasts, Pretor-Pinney’s gallery of evanescence is a 
reminder of the simple joy of perception. 
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Transgenic sushi is an unusual biotech teaching tool. 


Chew on this 


Anthony King savours a surre 


ally varied show on food, 


from glowing sushi to 1,001 uses for a pig carcass. 


slaughter at any one time on Earth. 

What humans eat determines which 
crops are grown and which animals are 
reared, with big implications for science, 
agriculture, the environment and society. 

Edible, an exhibition at Dublin’s Science 
Gallery, probes our cultivation and con- 
sumption choices and their effects in the 
contexts of time, place, technology and 
taste. There are serious messages, but much 
of what is on offer has a zany flavour. 

The increasingly globalized agricultural 
system gets more than a look-in. Artist 
Christien Meindertsma’s photobook on ‘pig 
parts’ is a compellingly graphic exploration 
of the afterlife of a pig carcass. For PIG 05049 
(the number denotes the animal's ear tag), 
Meindertsma spent three years tracing the 
products derived from a single animal raised 
in the Netherlands to build a visual chronicle 
of the carcass’s incredible global journey. 

The pig's skin was used for tattoos, chew- 
ing gum, injectable collagen, safety gloves and 
cheesecake, while its bones went into paper, 
X-ray films, book covers and train brakes. 
Cigarettes, cooked ham, mink feed and 
photodynamic therapy accounted for some 
of its blood, and its gelatine eased cordite into 
bullets. Meindertsma 


F ifty billion chickens are heading to the 


dissects the pig allover NATURE.COM 

again, depicting the  Formoreon 

products to scale. attitudes to food: 
Humanity’s over- _ go.nature.com/gtmfyx 
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reliance on a handful 
of crops is explored 
in an infographic 
revealing how rice, 
wheat, maize (corn) 
and potatoes make up 
50% of what we eat. The exhibition’s cura- 
tors, Cat Kramer and Zack Denfeld — artists 
with the mobile research institute the Center 
for Genomic Gastronomy — point out that 
new crops could gain dominance within a 
century: “The future of food is much more 
complex than we can predict, they write. 

One aspect of that complexity is the need 
to maintain species diversity as insurance 
in case of disease; the Food and Agriculture 
Organization of the United Nations in Rome 
estimates that crop genetic diversity has 
fallen by 75% since 1900. The Edible Herit- 
age Lab features 12 historically and scientifi- 
cally important varieties of potato. Part of a 
haul collected by the Irish Seed Savers Asso- 
ciation, which champions non-commercial 
Irish heritage varietals, they include the 
lumper potato, which succumbed to succes- 
sive attacks by the fungal blight Phytophthora 
infestans in the mid-nineteenth century, trig- 
gering the great Irish potato famine. 

Close by, a stuffed grey squirrel illus- 
trates the question of whether we should 
eat invasive species. A recipe for Guinness 
and grey squirrel stew is provided for the 
brave. Insects give more food for thought; 
eaten over much of the world, they are seen 


EDIBLE: The Taste 
of Things to Come 

Science Gallery, Trinity 
College Dublin. 

Until 5 April 2012. 
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by many as a part of future food security. 
Insects Au Gratin by designer Susana Soares 
and her colleagues consists of three-dimen- 
sional sculptural shapes printed using milled 
insect flour. Crickets, we're told, are more 

efficient than cattle in converting veg- 

etation to protein: 100 kilograms of 

feed produces 40 kg of the insects, but 
only 10 kg of beef. 

Given the current fashion for foodyism 
and molecular gastronomy, the science 
of taste inevitably plays its part. You can 

learn whether you are a supertaster 

by gauging the bitterness of a disc 
infused with bitter compounds, or 
pinpoint where particular taste buds 
are concentrated by sprinkling droplets 
of five solutions onto your tongue. 
But with taste, the proof of the pudding 
is obviously in the eating. A ticketed ‘curated 
dinner’ — a molecular-gastronomy evoca- 
tion of the spirit of Dublin — by chef-for- 
the-night Pete Williams proved a sensory 
treat. The recipes, by local company Design- 
goat, featured a contemporary take on the 
traditional Irish coddle, involving a poached 
pork-mousse sausage with a mashed-potato 
gel, set carrot foam and a bacon and pork 
sausage broth in a test tube. 
The diners themselves learned to whip up 
a gin and tonic with a difference. Spheres of 
cucumber, lime juice and salt were mixed with 
sodium alginate and sodium citrate, then set 
in a bath of calcium chloride. The resulting 
chewable spheres went into the drinks. 
Backat the exhibition, a “mutagenic mist” 
reminds the audience that radiation has 
been used since the mid-twentieth century 
to induce mutations in crops. The mist, a 
palate-cleansing puff of peppermint oil, is 
made from a varietal approved in the 1970s 
and bred for disease resistance. Most pepper- 
mint oil consumed today is from this variety. 
Elsewhere, you can watch The Glowing 
Sushi Cooking Show, a video of Kramer and 
Denfeld creating an 


“The future of even more startling 
fo odis much treat: transgenic, bio- 
wire complies luminescent zebrafish 
p sushi. It is an intrigu- 

than we can 


ing way to absorb 
cutting-edge biotech- 
nology. “This will be 
fun at parties,” quips Denfeld. 

There is much more to this feast, from smog 
tasting to the fun inflated-stomach sculp- 
ture Gas Bag by artists Andy Best and Merja 
Puustinen. Edible offers a menu of appetizing 
morsels on the evolution of gastronomy and 
its influence on our ecological, technological, 
commercial and political world — some- 
thing to chew on even if youre not yet ready 
to snack on a cricket. m 


predict.” 


Anthony King is a writer based in Dublin. 
e-mail: anthonyjking@gmail.com 


J. SHERWOOD 


JIM HARRISON 


Barry Mazur believes that maths and literature are driven by the same imaginative impulse. 


Q&A Barry Mazur 


The maths raconteur 


Barry Mazur, a mathematician at Harvard University in Cambridge, Massachusetts, has 
explored the literary side of mathematics. With the publication this month of Circles Disturbed, 
a collection of essays on mathematics and narrative that he edited with writer Apostolos 
Doxiadis, he talks about the overlapping realms of mathematics and the imagination. 


How did you get into mathematics? 
Through electronics. At school I read popu- 
lar works on electromagnetism. I found 
it baffling that there could be action at a 
distance that let energy travel through a 
medium you couldn't touch. As a student at 
the Massachusetts Institute of Technology in 
Cambridge, I saw mathematics as a gateway 
to a more philosophical understanding of 
electronics. I wanted to understand physical 
principles, such as conservation of energy, 
from a Euclidean standpoint: rarified and 
axiomatic. I later saw that this was impos- 
sible, but I became riveted by mathemat- 
ics. | came to Harvard in 1959, starting in 
topology then moving to algebraic geometry, 
which has led me to number theory. 


What inspired you to write your 2003 book, 
Imagining Numbers? 

It began as a letter to a friend. We had been 
chatting about the difficulties of imagining 
certain things and he said that he would like to 
know what imaginary numbers were. It took 
more than 200 years for mathematicians to 


become comfortable 
with them, so I thought 
it might be instructive 
not to directly teach 
this to my friend, but 
to help him to arrive 
at his own familiar- 
ity with them. I wrote 
him a letter inviting 
him to do just that. It 
expanded, circulated 


Circles Disturbed: 
The Interplay of 
Mathematics and 


Narrative among literary friends, 

EDITED BY and eventually became 

APOQS a YIADIC 

AF OSTOLOS DOXIADIS the book. 

AND BARRY MAZUR 

Princeton University : . 

Press: 2012. 552 pp. And Circles Disturbed? 

$49.50, £34.95 The concept came 
from Apostolos Doxi- 


adis. His 1992 novel Uncle Petros and Gold- 
bach’s Conjecture marries literature and 
mathematical themes, and his 2008 graphic 
novel Logicomix is about the mathematicians 
Gottlob Frege, David Hilbert, Henri Poin- 
caré, Ludwig Wittgenstein and Kurt Godel, 
and the early-twentieth-century crisis in 
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the foundations of mathematics. Apostolos 
invited mathematicians, historians and lit- 
erary scholars to write about mathematics 
and narrative. They met in Greece in 2007 
to discuss their work; this was the origin 
of Circles Disturbed. They interviewed one 
another to make their contributions more 
accessible. These interviews are available at 
www.thalesandfriends.org. 


How can historians tell the story of maths? 
You can recount mathematical lives, as 
Peter Galison does in the book. His essay 
contrasts the biographies of John Archibald 
Wheeler — whose mechanical illustrations 
of mathematical operations were influenced 
by his upbringing on a Vermont farm — and 
Nicolas Bourbaki, the pseudonym ofa group 
of Parisian mathematicians whose vision of 
mathematical progress was monolithic and 
structural. Or you can take the meta-histor- 
ical approach, as Amir Alexander does in his 
essay, which discusses how the transforma- 
tion in mathematics in the eighteenth and 
nineteenth centuries was partly guided by 
the stories told about it. You can do a close 
reading of dramatic mathematical texts, as 
in Federica La Nave’s examination of a work 
on imaginary numbers by sixteenth-century 
Italian mathematician Rafael Bombelli, who 
laboured for decades over the question of 
whether these numbers are fictitious. 


And is it possible to use mathematics for 
literary purposes? 

Yes, you can reverse roles. A mathematical 
sensibility is useful for illuminating patterns 
and structures — such as the structure of 
narrative, as in our book. A seminal example 
is French literary theorist Gérard Genette’s 
1972 modelling of the tangled layers of time 
in the writings of Marcel Proust. 


Is there a place for dreams in mathematics? 
In Michael Harris's chapter on intuition in 
Circles Disturbed, he recounts how mathe- 
matician Robert Thomason had a dream in 
which a dead friend (not a mathematician) 
made a mathematical claim. Although the 
claim was false, it inspired Thomason, once 
awake, to successfully complete his math- 
ematical project, and to include his friend 
as a co-author in the published paper. 


What do the pursuits of maths and literature 
have in common? 

The common thread is urgency. Mathema- 
ticians are impelled by an urgent desire to 
understand. In literature, there is a simi- 
lar impulse to engage in an all-absorbing 
imaginative experience. When it comes to 
mathematical inspiration, the imagination 
could be considered the ghost in the rational 
machine of proof-making. = 


INTERVIEW BY JASCHA HOFFMAN 
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Mixed media policies 
of US federal agencies 


You cite two US government 
agencies that interact openly with 
the media (Nature 483, 6; 

2012): the National Oceanic and 
Atmospheric Administration 
(NOAA) and the National 
Science Foundation (NSF). Yet 
these are rare exceptions in my 
experience as editor of FDA 
Webview and FDA Review. 

Take the US Department of 
Health and Human Services 
(DHHS), which last September 
proclaimed a repressive media 
guidelines policy. All journalists 
are now referred to its press 
office, which permits access only 
to selected personnel and may 
monitor and record interviews 
(see go.nature.com/xczin6). 

Another large US government 
entity, the Department of 
Justice, has gone even further. 
Barely a month after the DHHS 
move, it proposed a rule that 
directs federal law-enforcement 
agencies to respond to requests 
for particularly sensitive 
documents under the Freedom 
of Information Act “as ifthe 
excluded records did not exist” 
(see go.nature.com/gfd8pn). 

Recent guidelines on openness 
from the NSF and NOAA 
are modest steps in the right 
direction — albeit on a tiny 
scale, given the size, scope and 
complexity of the US federal 
government. 

James G. Dickinson Camp Hill, 
Pennsylvania, USA. 
jim@fdaweb.com 


China: a cultural 
shift for science 


I disagree that the ancient 
cultural influences of 
Confucius and Zhuang Zhou 
are to blame for the problems 
pervading China's academic 
research today (Nature 481, 
411; 2012). Neither sage 
encouraged isolation or 
inhibited intellectual curiosity. 
I believe that China’s problems 
are exacerbated by insufficient 


respect for these cultural 
traditions, which promote such 
values as objectivity and high 
moral standards. 

China’s scientists should be 
devoting themselves to the 
advance of research that abides 
by international scientific 
standards. Misconduct and 
other flaws undermine the 
Chinese government's increased 
investment in higher education 
and scientific research. 

It is time for the research 
community in China to 
rekindle its respect for 
cultural traditions and to take 
responsibility for creating a 
healthy, transparent academic 
system. 

Keqian Xu Nanjing Normal 
University, Jiangsu, China. 
xukeqian@njnu.edu.cn 


China: philosophers 
sparked good science 


Peng Gong misrepresents 

the thoughts of Chinese 
philosophers Confucius and 
Zhuang Zhou by suggesting 
that they hinder scientific 
advancement in modern China 
(Nature 481, 411; 2012). 

Confucius encouraged 
curiosity and practice in 
teaching and learning. His 
thoughts are universal and 
timeless, and have influenced 
many other Asian countries — 
including Japan, where sound 
science thrives. 

Zhuang advocated harmony 
so that we could fulfil our 
essential connection with 
nature. This view is pertinent 
in today’s China, where the 
environment and human health 
are being damaged by explosive 
and unbalanced development. 

Moreover, Confucius, Zhuang 
and other ancient Chinese 
philosophers made significant 
advances in science, technology, 
medicine, mathematics, 
astronomy and architecture, 
with inventions such as paper 
and the compass having a large 
impact on civilization. 

Jianxin Su Fujian University of 


Technology, Fuzhou, China. 
jianxinsu@gmail.com 
Cheng Lu Newark, Delaware, USA. 


Work together to 
crack wildlife trade 


There is no single solution to 
monitoring illegal wildlife trade 
(L. FE. Toledo et al. Nature 483, 
36; 2012), which is widespread, 
adaptive and sophisticated. 
Extensive reforms are needed to 
expand government monitoring 
and reporting, and to include 

a wider range of detection 
strategies and non-government 
stakeholders (J. Phelps et al. 
Science 330, 1752-1753; 2010). 

Government agencies are 
largely limited to regulating 
wildlife trade through 
monitoring and enforcement, 
mainly confined to checkpoints 
such as airports. More 
resourceful research strategies 
are crucial for keeping up with 
wildlife trading (see, for example, 
M. Gastafiaga et al. Bird Conserv. 
Int. 21, 76-85; 2011). 

Improving regulation 
will require broadening 
collaborations, for example 
through resource monitoring 
by local communities, engaging 
the private sector in point- 
of-harvest reporting, and 
forming partnerships with 
non-governmental organizations 
and academics. Local networks 
and expertise would encourage 
pooling of resources and give 
governments access to new 
techniques and methodological 
support. 

Such cooperation would help to 
resolve taxonomic uncertainties 
and offer independent review 
of contentious trade decisions. 
This is especially relevant to 
countries where the technical 
and administrative authorities of 
the Convention on International 
Trade in Endangered Species 
lie within the same government 
institution. 

Jacob Phelps, David P. Bickford, 
Edward L. Webb National 
University of Singapore, Singapore. 
jacob.phelps@gmail.com 


Colorado methane 
study not clear-cut 


As a partner of the US National 
Oceanic and Atmospheric 
Administration, we at Western 
Energy Alliance provided much 
of the data for the reported study 
on methane emissions from a 
natural-gas field in Colorados 
Denver-Julesburg Basin (Nature 
482, 139-140; 2012). However, 
uncertainties remain over the 
extent of methane leakage from 
these operations. 

For example, this study 
(G. Pétron et al. J. Geophys. Res., 
in the press) relied heavily on 
data from mobile-laboratory 
samples taken in summer 2008, 
before regulations reduced 
emissions of volatile organic 
compounds to 90%, and before 
low-bleed pneumatic devices 
were implemented to reduce 
methane emissions from the 
field. Much of the data were 
collected in an operating 
environment that no longer 
exists. Also, the venting you 
describe is at odds with industry 
practices. 

The authors acknowledge 
that methane emissions from 
agriculture are of “comparable 
magnitude to the estimate from 
natural gas systems” in Colorado, 
but do not consider these. 

Pétron et al. should have 
noted that methane readings 
were lower at the Boulder 
observation tower than at a 
tower in a Wisconsin forest that 
is far removed from significant 
industrial activity. 

Kathleen M. Sgamma Western 
Energy Alliance, Denver, 
Colorado, USA. 

ksgamma@ 
westernenergyalliance.org 


CORRECTION 

A declaration of competing 
financial interests was omitted 
from the Correspondence 
‘Sugar: fruit fructose is still 
healthy’ (Nature 482, 470; 
2012). This has been added at 
http://go.nature.com/xudf1i. 
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OBITUARY 


Renato Dulbecco 


(1914-2012) 


Molecular biologist who proved that virus-derived genes can trigger cancer. 


enato Dulbecco, a giant of cancer 
Rice: passed away peacefully on 

19 February, just three days shy of his 
98th birthday. Through a decade-long series 
of experiments that began in the late 1950s 
— first at the California Institute of Tech- 
nology (Caltech) in Pasadena and then at 
the Salk Institute in La Jolla, California — he 
and his co-workers demon- 
strated that the behaviour of 
cancer cells could be traced 
directly to acquired genes. 
Some viruses can integrate 
their genomes into host cell 
DNA, inducing uncontrolled 
cell growth — the hallmark 
of a cancer cell. For these 
early clues to the genetic ; 
origins of cancer, Dulbecco 5% | 
shared the 1975 Nobel prize | 4 
with the late Howard Temin, 
his former student, and with 
David Baltimore, my former 
mentor. 

Born in Catanzaro, Italy, on 
22 February 1914, Dulbecco 
grew up during the First 
World War and its aftermath 
with his mother and siblings. 
He entered the University of 
Turin in 1930 at the age of 16 
to study medicine. His inter- 
ests quickly shifted to biology 
and he became a laboratory 
assistant for Giuseppe Levi, learning from 
him the cell-culture techniques that hada 
dominant role in Dulbecco’ later scientific 
work. His fellow students included two 
other future Nobel laureates — microbiolo- 
gist Salvador Luria and neurobiologist Rita 
Levi-Montalcini. 

After obtaining his doctorate in 1936, 
Dulbecco was drafted into the Italian army 
as a physician. He was called to action in 
1939, at the outset of the Second World War, 
to serve first in France and then in Russia. 
Disaffected by the war and its consequences, 
he deserted the army and hid in a small vil- 
lage near Turin, becoming physician to the 
local partisan units resisting the German 
occupation. After the end of the war, follow- 
ing a short stint as an elected city councillor 
in Turin in 1945, he returned to scientific 
research at the University of Turin. 

In 1946, Luria invited Dulbecco to join his 
research group at the University of Indiana 
in Bloomington to work on bacteriophages 
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— the viruses of bacteria. Dulbecco shared 
his laboratory with a graduate student, 
James Watson, who was later to determine 
the structure of DNA with Francis Crick. 
Dulbecco’s finding that visible light can 
reactivate phages previously rendered inac- 
tive by exposure to ultraviolet light — with 
repercussions for experimental methods 


= 


at the time — brought him to wider atten- 
tion. Phage geneticist Max Delbriick invited 
Dulbecco to join him at Caltech in 1949 as a 
research fellow. 

Dulbecco felt that research into animal 
viruses would benefit hugely from improved 
quantitative techniques. In 1952, he devel- 
oped the plaque assay for Western equine 
encephalomyelitis virus: a system that 
allowed researchers to determine the num- 
ber of biologically active viral particles in a 
sample by counting the number of discrete 
plaques, or clear spots, where a single virus 
and its progeny had killed host cells. Soon 
Dulbecco was joined by Marguerite Vogt, 
a German émigré. Their fruitful collabora- 
tions led to quantitative assays for the polio- 
myelitis virus, a scourge at the time, and later 
for the polyoma virus — a DNA virus that 
causes many different types of tumours. 

In 1963, Dulbecco left Caltech to become 
one of the first fellows of the Salk Institute, 
founded by Jonas Salk of polio vaccine 
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fame. He and Vogt continued their research 
collaborations on tumour viruses, defini- 
tively proving the essential role of the viral 
genes in transforming cells into cancer cells. 
During this highly productive period, Dul- 
becco trained some of the top virologists in 
the world. In 1972, he moved to the Impe- 
rial Cancer Research Fund (now Cancer 
Research UK) in London, 
where he initiated studies 
of breast cancer — a subject 
he continued to pursue after 
moving back to California in 
1977. His last scientific paper, 
published in 2004, identified 
tumour-initiating cells in rat 
mammary cancers. In 1988, 
Dulbecco assumed the presi- 
dency of the Salk Institute, 
following the illness of long- 
term president Frederic de 
Hoffmann, and brought sta- 
bility during a difficult time. 

An experimentalist at 
heart, Dulbecco loved to 
work in the laboratory. He 
was a quiet man, yet did not 
shy away from making bold 
visionary statements: in a 
classic 1986 perspective in 
Science (R. Dulbecco Science 
231, 1055-1056; 1986), he 
argued that the best way to 
understand cancer would be 
to sequence the human cancer genome. In 
1992, Dulbecco stepped down from the Salk 
presidency to organize the Italian contribu- 
tion to the Human Genome Project. He spent 
a good part of his time in Italy, where he was 
much revered and helped the careers of many 
young scientists. 

Dulbecco was an elegant man: a dapper 
dresser, soft spoken, with an ever-present 
smile. He was comfortable among the nobil- 
ity, yet adept at changing the oil in his car; 
at once distant yet approachable. He was a 
charming host at dinners organized by his 
wife Maureen, which my wife and I often had 
the pleasure of attending. In short, he was 
always the perfect gentleman. = 
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Life imperfectly imitates life 


Some species evolve to resemble another species so as to protect themselves from predation, but this mimicry is often imprecise. 
An analysis of hoverflies suggests why imperfect imitation persists in the face of natural selection. SEE LETTER P.461 


DAVID W. PFENNIG & DAVID W. KIKUCHI 


ne hundred and fifty years ago, the 
() English naturalist Henry Walter 

Bates’ discovered a phenomenon that 
he regarded as “a most powerful proof of the 
theory of natural selection”. Charles Darwin 
and Alfred Russel Wallace, who had pro- 
posed natural selection as the engine of evolu- 
tion only a few years earlier, agreed. Indeed, 
Darwin’ considered Bates’s manuscript to be 
“one of the most remarkable and admirable 
papers I ever read”. 

Bates had uncovered a spectacular type of 
adaptation, now known as Batesian mimicry’, 
in which a species (the mimic) that is edible 
to predators evolves to resemble a conspicu- 
ous, inedible species (the model) that preda- 
tors avoid. These lookalikes are selectively 
favoured, Bates argued’, because predators 
confuse them for the inedible model and 
thereby avoid them. This idea was so persua- 
sive that Batesian mimicry is still widely used 
to illustrate the power of natural selection for 
producing adaptations**. However, mimics 
are often poor replicas of their model, and this 
inexact mimicry poses a challenge for evo- 
lutionary theory’. On page 461 of this issue, 
Penney et al.” discuss possible explanations for 
why mimics are frequently imprecise. 

As Penney and colleagues describe, several 
non-mutually exclusive hypotheses have been 
proposed to explain imperfect mimicry. They 
include: the ‘eye-of-the-beholder’ or sensory- 
limitation hypothesis, which asserts that 
imprecise mimicry is an artefact of human per- 
ception; the ‘jack-of-all-trades’ or multimodel 
hypothesis, which posits that imperfect mimics 
persist because they are under selection pres- 
sure to resemble more than one model; and the 
‘kin-selectiom hypothesis, which asserts that 
imperfect mimicry is maintained because it 
provides benefits for genetically related indi- 
viduals. Specifically, if mimics are imperfect, 
then predators will be more discriminatory 
and occasionally spare a mimic by mistaking 
it for a model. This will also spare relatives that 
share the same imperfect trait. 

Another theory to explain imperfect mimicry 
is the ‘relaxed-selection hypothesis, which 
holds that there is little selective benefit in 
refining mimetic resemblance beyond a certain 


Figure 1 | A fly in wasp’s clothing. The harmless marmalade hoverfly, Episyrphus balteatus, shown here, 
resembles stinging wasps. Although some species of hoverfly closely resemble wasps or bees, others are 
poor lookalikes. Penney and colleagues’ show that the lower selection pressures that come with decreasing 
abundance and small body size may provide the explanation for how such imprecise mimicry can persist. 


point, for example if the model is common or 
especially noxious. Finally, the ‘constraints’ 
hypothesis proposes that either imperfect mim- 
ics lack the genes to produce refined mimicry, 
or that a selective trade-off exists between pred- 
ator-mediated selection favouring more precise 
mimicry on the one hand and other agents 
of selection (such as mate choice) favouring 
divergence on the other. 

Until now, no study had rigorously evaluated 
these multiple hypotheses within a single sys- 
tem. Penney and colleagues’ did precisely this 
by comparing the degree of mimicry in dozens 
of species of hoverfly. Mimicry is a dominant 
feature of this large family of harmless Diptera 
(Fig. 1). About 5,600 species have been 
described, at least a quarter of which resem- 
ble stinging wasps and bees (Hymenoptera)’. 
Although some hoverflies are remarkably 
accurate mimics, converging both morpho- 
logically and behaviourally on their hymen- 
opteran model, many others seem to bear 
poor resemblance’. This variation in mimetic 
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fidelity makes hoverflies ideal for studying 
imperfect mimicry. 

The authors report that birds (which are 
hoverfly predators) and humans seem to 
agree on the resemblances between hoverflies 
and hymenopterans. Thus, in this case, the 
eye-of-the-beholder hypothesis does not 
hold. Furthermore, Penney and colleagues’ 
statistical analysis reveals that the mimics’ 
characteristics do not fall somewhere between 
those of different models, so the jack-of-all- 
trades hypothesis receives no support either. 
The kin-selection hypothesis predicts that 
mimetic precision should decrease with an 
increasing abundance of mimics, but the 
authors observed the opposite trend. In fact, 
this finding is consistent only with the con- 
straints and relaxed-selection hypotheses. In 
addition to predicting that mimics’ precision 
should increase with their abundance, these 
two hypotheses predict that mimetic preci- 
sion should increase with the body size of the 
mimic, and this is exactly what the authors find. 


M. COLE/FLPA 


To understand how the relaxed-selection 
hypothesis applies when both the abundance 
and body size of mimics increase, let us con- 
sider the problem of discriminating between 
tasty mimics and nasty models from the preda- 
tors’ perspective. If not all mimics are equally 
deceptive, a predator must take risks when 
attacking its prey — ifit strikes a mimic, it will 
reap more reward than regret. When mim- 
ics are abundant, the likelihood of attacking 
a model by mistake decreases. In such condi- 
tions, predators should be willing to sample all 
but the best mimics, which will push selection 
towards more precise mimicry. 

By the same token, when mimics have a 
larger body size, their potential benefit to 
predators is greater (larger prey are gener- 
ally more profitable for predators), so there 
will be greater selection pressure on them 
than on their smaller counterparts to become 
better mimics. Although other studies have 
found that mimetic precision increases 
when mimics are abundant’, Penny and col- 
leagues’ demonstration that this pattern also 
holds when mimics increase in size is an 
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Swarming 


elegant affirmation of an old idea. 

We still do not know whether hoverfly 
mimicry is imprecise because of an absence of 
selection for refinement once mimicry is ‘good 
enough’ (as in the relaxed-selection hypoth- 
esis), or whether there is active selection 
pressure against further refinement because 
of the costs of producing better mimicry (as 
in the constraints hypothesis). For example, 
constraints may be imposed by competi- 
tion between mimics and their models (over 
shared resources® or reproductive opportuni- 
ties’), which would favour divergence between 
them and, hence, imprecise mimicry”. Future 
studies are needed to tease apart these two 
hypotheses. 

As we celebrate the 150th anniversary of 
Bates’s discovery of mimicry’, the topic con- 
tinues to fascinate the public and scientists 
alike**. Penny and colleagues’ findings help us 
to understand why selection sometimes pro- 
duces precise mimicry, but often does not, and 
further clarification of this puzzle promises to 
provide additional insight into the evolutionary 
process. m 


microtubules 


An artificial system of microtubules propelled by dynein motor proteins 
self- organizes into a pattern of whirling rings. This observation may provide 
insight into collective motion in biological systems. SEE LETTER P.448 


TAMAS VICSEK 


he spectacle of animals moving 
en masse is arguably one of the most 
fascinating phenomena in biology. For 
example, schools of fish can move in an orderly 
manner, and then change direction abruptly 
or, if under pressure from a nearby preda- 
tor, swirl like a vigorously stirred fluid. The 
non-living world also has examples of collec- 
tive motion, in systems that consist of units 
ranging from macromolecules to metallic 
rods, or even robots. On page 448 of this issue, 
Sumino et al.' describe another, until now 
unobserved, example of such behaviour: the 
coordinated motion of hundreds of thousands 
of subcellular structures known as micro- 
tubules, which spontaneously self-organize 
into a lattice-like structure of vortices. When 
considered in the context of about halfa dozen 
known universal classes of collective-motion 
pattern’, this new structure poses challenges 
in terms of explaining how it can arise and its 
relevance to applications. 
If two or more moving units such as 


self-propelled rods interact, their direction 
of motion is likely to change. When pairwise 
interactions dominate over multiparty ones, 
the process of two units approaching each 
other, then adopting a new direction and leav- 
ing the area of interaction, can be interpreted 
as a collision. In closed systems at equili- 
brium, such collisions conserve energy and 
momentum, whereas, for example, when two 
birds interact and decide which way to fly in 
the open expanses of the sky, the concept of 
conservation of momentum is not applicable. 

Typically, local interactions between organ- 
isms result in a consensus: fish or mammals 
tend to adopt a common direction of motion. 
Such ‘polar interactions are widespread and 
have been observed even for bacteria’ and 
locusts". Nonetheless, it was a great achieve- 
ment when, in 2010, two groups”® observed 
motional patterns associated with polar inter- 
actions on a molecular scale — that is, for a 
huge number of actin filaments on a layer of 
immobilized myosin protein heads. 

In addition to polar interactions, ‘nematic 
interactions also occur. In this case, if the 
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directions of motion of two units approaching 
one another form an angle smaller than 90°, 
both parts will take the same direction after the 
approach. But if the units come towards one 
another from directions that differ by more 
than 90°, they will leave in opposite directions. 

Sumino et al.' have constructed a biological 
system in which nematic collisions take place 
(Fig. 1). The authors achieved this by choos- 
ing an assay of moving microtubules propelled 
by modified motor proteins (one-headed 
dynein molecules) in which the microtubules 
cannot, for the most part, cross each other’s 
trajectories and maintain their own trajectory’s 
direction and curvature. The authors find that, 
in this setting, and for relatively high densities 
(typically 5 microtubules in 100 square micro- 
metres), the moving microtubules self-organize 
into a semiregular pattern of whirling rings, 
or vortices, within which they move either 
clockwise or anticlockwise. Furthermore, as 
time goes on, the microtubules jump from one 
vortex to another and change their rotational 
direction. The size of the observed pattern is 
large compared with the 15-micrometre length 
of each microtubule: the system shows regulari- 
ties on a millimetre scale. 

One of the strengths of the present study is 
the authors’ ability to explain the main features 
of the microtubules’ intricate motion with a 
simple model. The model draws on studies 
aimed at understanding the rich, large-scale 
behaviour that results from simple bilateral 
interactions between point-like, self-propelled 
particles. 

But Sumino and colleagues’ results also 
prompt several questions. For example, is 
the observation that individual microtubules 
have a slight preference to rotate anticlockwise 
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Figure 1 | Nematic collisions. Sumino et al.' have 
studied a system of microtubules in which nematic 
collisions occur. The microtubules are propelled 
by one-headed dynein molecules attached to 

the surface of a substrate. Three microtubules 

are shown here: a, approaching one another; 

b, colliding; and ¢, after the collision has taken 
place. Their final direction of motion depends on 
the angles at which they approach one another. 

In this case, the two leftmost microtubules 

adopt the same direction, whereas the rightmost 
microtubule leaves in the opposite direction. 


a necessary condition for the observed 
semi-regular pattern of vortices? How is the 
persistence of the curvature of the micro- 
tubules’ trajectories relevant? Furthermore, it 
would be interesting to investigate why actin 
filaments undergo polar interactions, whereas 
microtubules undergo nematic ones. 
Whether the authors’ observations are 
system specific or belong to a new class of col- 
lective-motion pattern remains to be seen. But 
one possibility is that the observed lattice-like 
structure falls within the group of patterns of 
vortex formation seen in many systems made 
up of self-propelled units. Typically, vortex 


formation can be closely related to the geometry 
of the confined area within which the collec- 
tive motion occurs. In the current case, it seems 
that, owing to the persistence of curvature in the 
individual microtubules motion, curved ‘walls’ 
of microtubules are spontaneously formed and 
enforce the formation of the vortices. Indeed, it 
has been shown’ that, without such curvature 
persistence, nematic interactions in a system 
of self-propelled rods do not result in a similar 
arrangement of vortices. 

Potential applications of Sumino and col- 
leagues’ study remain, for the time being, in 
the realm of speculation. But it is conceivable 
that the complex fluid currents generated in 
the stems of various plants relate to the collec- 
tive motion of microtubules observed by the 
authors’. The streams of energy-supplying 
and waste materials in a stem have to fulfil a 
number of criteria, which could be best met 
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through a coordinated action of motor mol- 
ecules. The authors’ demonstration that such 
action can be attained in a relatively simple 
assay is a step towards a better understanding 
of the role of collective motion in plants and 
other biological systems. = 
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How to duplicate 
a DNA package 


Cells replicate half of their genome as short fragments that are put together later 
on. The way in which this process is linked to the formation of DNA-protein 
complexes called nucleosomes is now becoming clearer. SEE ARTICLE P.434 


ALYSIA VANDENBERG 
& GENEVIEVE ALMOUZNI 


of DNA is wrapped around proteins to 
form nucleosomes, the basic units of 
chromatin. When this DNA is duplicated, the 
nucleosomes that lie ahead of the replication 
machinery are displaced, or disassembled into 
their component histone proteins. And after 
the machinery has passed by, the nucleosomes 
are reassembled on the two resulting ‘daugh- 
ter’ strands, which requires the addition of new 
histones’.To make things more complicated, 
one of the two DNA strands can be replicated 
only in short pieces — called Okazaki frag- 
ments — that are joined together afterwards, 
and so its replication lags behind that of the 
other strand (Fig. 1). This elaborate mecha- 
nism is needed because DNA synthesis is 
unidirectional, whereas the sequences of the 
two DNA strands run in opposite directions. 
However, the links between DNA replication 
and nucleosome dynamics have previously 
been studied without distinguishing between 
the two strands. On page 434 of this issue, 
Smith and Whitehouse’ uncover the interplay 
between nucleosome assembly and the synthe- 
sis of the lagging strand. 
The authors’ analysed Okazaki fragments 


E a typical human cell, about two metres 
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from baker’s yeast, a unicellular fungus 
commonly used as a model for eukaryotic 
cells — those found in organisms such as ani- 
mals, plants and fungi. Smith and Whitehouse 
took advantage of an available large catalogue 
of defined yeast mutants, together with well 
annotated genomic information that included 
recently defined nucleosome positions’. The 
researchers developed an elegant method for 
purifying Okazaki fragments from yeast, then 
sequenced the fragments and mapped them on 
the yeast genome. The result is the first high- 
resolution, genome-wide analysis of Okazaki 
fragments from live eukaryotic cells. The anal- 
ysis is a major achievement, as our knowledge 
of lagging-strand synthesis has been based 
largely on model studies’ of the replication of 
non-chromosomal DNA molecules such as 
viral DNA. 

Smith and Whitehouse discovered that, 
strikingly, the Okazaki fragments’ size and 
location in the genome depend on where the 
nucleosomes are located. More specifically, 
the ends of the Okazaki fragments fall near the 
nucleosome’s dyad axis, which corresponds to 
the midpoint of the DNA segment wrapped 
around the chromatin unit. This result con- 
trasts with a common assumption that, dur- 
ing lagging-strand replication, a nucleosome 
would form as soon as the Okazaki fragment 


Pole 
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Nucleosome 
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Figure 1 | Asymmetric DNA replication. The sequences of the two strands ofa DNA molecule run in 
opposite directions, but during replication the new DNA molecules are synthesized in only one direction. 
Each strand is therefore duplicated differently. The leading strand is used as a template by a polymerase 
enzyme (Pol ¢, top), which makes a new molecule in a continuous manner. By contrast, duplication of 

the lagging strand is discontinuous and requires another polymerase (Pol 5, bottom), which synthesizes 
short DNA pieces known as Okazaki fragments. These fragments are then joined together with the help 
of other enzymes to form a continuous molecule. During the process, nucleosomes — protein complexes 
around which DNA is coiled — disassemble from parental DNA to then reassemble, along with additional 
nucleosome proteins as needed, allowing both DNA duplication and chromatin reorganization. Smith 
and Whitehouse’s analysis” of Okazaki fragments illuminates the link between nucleosome assembly and 


the synthesis of the lagging strand. 


was long enough, and then the next fragment 
would be synthesized. This assumption tallies 
with the observation that the average length 
of Okazaki fragments is similar to that of the 
DNA wound around a nucleosome. 

The authors’ propose an alternative model 
in which the presence ofa nascent nucleosome 
acts as a roadblock to an advancing polymerase 
enzyme (Pol 5), which is synthesizing the lag- 
ging strand. According to their model, Pol 6 
and other associated enzymes can invade the 
nascent nucleosome up to its dyad axis, but 
then the nucleosome mediates the dissocia- 
tion of all of these enzymes from the DNA — 
by doing so, the nucleosome determines the 
length of the Okazaki fragment. 

To test the model, Smith and Whitehouse 
used yeast strains in which the physiological 
balance between polymerases and nucleo- 
somes was altered, either because the polymer- 
ases were less active or because nucleosome 
assembly was impaired. For example, they 
observed’ that the Okazaki fragments were 
longer in mutant strains defective in CAF-1, 
a protein complex known’ to act as a histone 
chaperone, which promotes histone deposition 
on newly synthesized DNA. Therefore, the 
tight balance between the capacity to form a 
new nucleosome on the lagging strand and the 
polymerase’s ability to advance is what sets 
the length of the Okazaki fragment. These 
findings also suggest that the efficiency of 
the replication machinery’s progression may 
depend on where the nucleosome forms. 

The authors’ work provides the basis for 
research on how other factors involved in his- 
tone dynamics and nucleosome movement 
on DNA — such as histone chaperones’ and 
chromatin remodeller enzymes — can affect 
Okazaki-fragment size and, overall, DNA 


replication. Moreover, additional proteins 
involved in chromatin assembly could be 
identified by screening the available collec- 
tion of yeast mutants for changes in Okazaki- 
fragment length. And the technique used for 
mapping Okazaki fragments offers a means 
to identify the genomic sites at which DNA 
replication starts. 

It is tempting to postulate that the inher- 
ent asymmetry of nucleosome dynamics 
during DNA replication could contribute to 
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a mechanism for asymmetric cell division, a 
process by which two different cell types are 
generated. This hypothesis is supported by the 
finding® that CAF-1 has a role in asymmet- 
ric cell division in the worm Caenorhabditis 
elegans. In addition, histones in nucleosomes 
exist in different forms associated with specific 
genomic regions, and can be ‘marked’ with 
various chemical modifications that reflect 
functional states, such as activation or repres- 
sion of gene expression. Whereas some of these 
modifications are transient, others may be 
inherited during cell division in an epigenetic 
fashion — that is, independently of the DNA 
sequence. But to understand how the chemical 
modifications of nucleosomes can be inherited 
during cell division, we must first determine 
how nucleosomal organization is reproduced 
during DNA replication. So Smith and White- 
house's study paves the way for a deeper explo- 
ration of the intimate relationship between 
genetics and epigenetics. m 
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Bubbles from the deep 


Astudy suggests that hydrocarbons released from sedimentary basins formed 
part of a climatic feedback mechanism that exacerbated global warming during 


the Eocene epoch. 


HENRIK SVENSEN 


as released from sedimentary basins 

has been proposed to be a key player 

in many of the rapid climate changes 
that occurred in the past 250 million years. 
The types and sources of the released gases 
are still debated, but they probably included 
gases formed by the heating of organic matter 
around hot magma’, and gases released by the 
dissociation of gas hydrates (solid compounds 
that trap gas molecules) found in deep-ocean 
sediments’. The influence of hydrates is greater 
in a warming world because higher ocean 
temperatures can melt methane-bearing gas 
hydrates, releasing the gas. The methane can 


then be oxidized to carbon dioxide, causing 
ocean acidification, and can contribute to 
global warming because both methane and 
carbon dioxide are greenhouse gases. Writing 
in Geophysical Research Letters, Kroeger and 
Funnell’ suggest another way in which global 
warming can lead to increased gas emissions 
from the sea floor. 

Sedimentary basins have large accumula- 
tions of biological, inorganic and clastic depos- 
its (which consist of fragments of pre-existing 
rocks), and are host to more than 99.9% of the 
organic carbon in Earth's crust. This amounts 
to a staggering 15,000,000 gigatonnes (Gt) of 
carbon; for comparison, 3,300 Gt of carbon 
are stored in all known hydrocarbon and 
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coal reserves*. Any viable mechanism for 
transferring significant quantities of sediment- 
bound carbon to the atmosphere on a short 
timescale may thus perturb the global carbon 
cycle and lead to global warming. 

A series of such perturbations has been sug- 
gested”* to have happened in the Eocene epoch 
(which ran from 55.8 million to 33.9 mil- 
lion years ago) to explain several short-lived 
climatic anomalies that occurred during a 
period of otherwise steady warming from 
58 to 50 Myr ago (Fig. 1). The most prominent 
of these events was the Palaeocene-Eocene 
thermal maximum (PETM), which was char- 
acterized by global warming of 5-10 °C and 
subtropical conditions in the Arctic. One 
cause of the PETM is commonly assumed to 
have been methane release to the atmosphere 
that was triggered either by the melting of 
gas hydrates” or by the heating of rocks rich 
in organic material in sedimentary basins 
(caused by widespread volcanic activity in the 
northeast Atlantic region’”’). 

The extended period of warming during 
the Eocene culminated in the Early Eocene 
Climatic Optimum (EECO) 52 to 50 Myr ago, 
the hottest prolonged climatic episode since 
the Cretaceous period 145.5 to 65.5 Myr ago. 
Kroeger and Funnell’ focus on the EECO in 
their work. Using computer models of four 
sedimentary basins in the southwest Pacific, 
they simulated the increase in hydrocarbon 
generation in the basins as the warming 
Eocene oceans transferred heat to the sea floor 
and to deep-seated, organic-rich rocks. 

The sedimentary basins studied by the 
authors all contained rocks that could pro- 
duce petroleum given the right temperature 
conditions and sufficient time. Organic matter 
within the sedimentary rocks can convert into 
petroleum at temperatures of 60-120 °C (the 
oil window), or into predominantly natural 
gas at temperatures of 100-200 °C (the gas 
window). On the basis of their models, Kroeger 
and Funnell estimate that the warming 
ocean during and after the PETM eventually 
raised the temperature of a 300-metre-thick 
layer of sediments into the oil and gas 
windows. A marked rise in hydrocarbon pro- 
duction would therefore have occurred in 
the 4 to 5 Myr following the PETM, peaking 
during the EECO. 

The authors’ results help us to understand 
the dynamics and temperature-dependence 
of hydrocarbon (petroleum and gas) genera- 
tion in sedimentary basins. But there is more 
to the story than that. If oil and gas generated 
during the Eocene somehow escaped the sed- 
imentary basins and leaked out to the ocean 
and the atmosphere in sufficient quantities, 
this may have contributed to global warm- 
ing at the time. In other words, Kroeger and 
Funnell propose a climate feedback mecha- 
nism: global warming causes increased hydro- 
carbon production that leads to prolonged 
global warming. 
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Figure 1 | Ancient climate change. The graph depicts ocean temperatures during the Palaeocene and 
early Eocene epochs, estimated from the abundance of oxygen isotopes in microfossils”®. A warming 
trend from about 58 to 50 Myr ago was punctuated by short-lived climatic anomalies known as 
hyperthermals, of which the Palaeocene-Eocene Thermal Maximum (PETM) was the greatest. The 
warmest climate occurred during the Early Eocene Climatic Optimum (EECO) about 52 to 50 Myr ago, 
after which climate cooling occurred. The PETM may have been caused in part by the release of methane 
to the atmosphere, triggered by the melting of gas hydrates (solids that trap gases in ocean sediments) 
and/or by the heating of organic matter by intruding magma in sedimentary basins. Kroeger and 
Funnell’ propose that the production of hydrocarbons (petroleum and natural gas) in sedimentary basins 
increased during and after the PETM, and that seepage of hydrocarbons from such basins into the ocean 


and atmosphere contributed to global warming. 


The idea that the conversion of organic 
matter into petroleum following a tempera- 
ture increase in sedimentary basins may form 
part of a climate feedback mechanism was 
first posited® by Kroeger and colleagues last 
year. The modelling in Kroeger and Funnell’s 
present work’ remarkably predicts and quan- 
tifies a peak in hydrocarbon production that 
overlaps in time with the EECO. The authors’ 
results show that about 37 Gt of oil and 8 Gt 
of gas were generated from the four basins 
during this period, which is 50% more than 
would have occurred in the absence of extra 
ocean warming. This alone would not have 
been sufficient to affect the Eocene climate’, 
but many other basins around the world would 
presumably have increased in temperature at 
the same time. The mass of hydrocarbons 
generated globally could therefore have been 
considerably higher than that predicted in the 
authors’ study, although the exact amount is 
difficult to quantify. 

Is hydrocarbon seepage at the sea floor on 
a climate-changing scale a realistic possibility? 
Kroeger and Funnell argue that it is, because 
more oil and gas are generated in sedimentary 
basins than are trapped””. Studies" of sedimen- 
tary basins around the globe have shown that 
gas seepage is currently acommon phenom- 
enon, and there is no reason why this should not 
have been the case earlier in Earth’s history”. 

But did the hydrocarbons predicted’ to have 
been generated during the Eocene really leak 
out to the ocean and atmosphere, or did they 
stay trapped in the subsurface? A way to test 
this is to search for carbonate deposits in old 
sea-floor sediments, because sea-floor hydro- 
carbon seepage leaves behind deposits that 


have distinct geochemical signals that can 
be attributed to their origin’’. Unfortunately, 
the carbonate record is currently too poorly 
investigated to confirm the extent of seepage 
from sedimentary basins during the Eocene, 
and so more studies are needed. Nevertheless, 
one thing seems clear: the transfer of carbon 
from sedimentary rocks to the atmosphere is 
an important component of climate change, 
both past and future. = 
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Influenza’s tale 


of tails 


Epigenetics is a hot new research field, but it seems that the influenza virus 
already has it figured out. By mimicking epigenetic regulation in human cells, 
one flu strain suppresses the expression of antiviral genes. SEE ARTICLE P.428 


ALEXEI L. KRASNOSELSKY 
& MICHAEL G. KATZE 


severe, or even deadly, but most people 

recover within a few days, helped by their 
body’s immune system. Yet this apparently 
simple interaction between the virus and its 
host is deceptive. Influenza viruses are, in 
fact, masters at circumventing their hosts’ 
defences — they co-opt cellular protein- 
synthesis pathways to produce viral proteins, 
for example, and antagonize aspects of the 
immune response. On page 428 of this issue, 
Marazzi et al.' describe another viral evasion 
tactic. They show that an influenza protein 
called NS1 mimics a host-cell histone protein 
that is involved in regulating gene expression. 
By hijacking this regulatory machinery, the 
virus inhibits the cell’s production of antiviral 
proteins”. 

Cells tightly regulate the expression of 
specific genes according to an organism’s 
developmental stage and physiological state, 
and in response to environmental stimuli such 
as infection. Gene expression can be altered 
in various ways, including by transcription- 
factor proteins, which bind to DNA and 
regulate the transcription of certain genes, 
or through epigenetic modification — herit- 
able chemical changes to DNA and associated 
proteins that do not affect the DNA sequence 
but alter the rate of transcription. Epigenetics 
has rapidly become a much-studied topic 
since these regulatory mechanisms were first 
reported around 20 years ago. 

A common epigenetic alteration is the 
addition of methyl or acetyl groups to histone 
proteins, which are the ‘building blocks’ of 
chromatin, the scaffold that packages DNA 
into chromosomes. The four main histone 
proteins (H2A, H2B, H3 and H4) consist of a 
globular domain and an unstructured amino- 
terminal tail region that is dominated by the 
positively charged amino acids lysine and 
arginine. These amino acids can be modi- 
fied by enzymes that add or remove methyl 
or acetyl groups”. Such modifications alter the 
histone—DNA interaction, which in turn leads 


[=< virus infections are sometimes 


“This article and the paper? under discussion were 
published online on 14 March 2012. 


to dynamic changes in chromatin that allow 
specific areas of the chromatin to unfold, or 
other proteins to be recruited to the site. Thus, 
by regulating the accessibility of a particular 
region of DNA, epigenetic modifications can 
regulate gene expression. 

Some epigenetic changes are generally asso- 
ciated with a site of active gene transcription’. 
An example of such a change is methylation 
at the lysine residue (K) of the amino acid 
sequence designated ARTK in histone H3 
(known as the H3K4 modification). Marazzi 
et al.’ found that the H3N2 influenza virus 
contains an amino-acid sequence (ARSK) 
very similar to the histone’s ARTK sequence 
(Fig. 1). This mimic sequence is found in the 
virus’s NS1 protein, which is not essential for 
viral structure but is known to have other roles 
in evading the host’s immune system*. The 
authors also show that this sequence similar- 
ity is functional — the NS1 tail can serve as 
a substrate for the histone-modifying enzyme 
Set1, a lysine methyltransferase. 

Marazzi and colleagues also demonstrate 
that H3N2’s histone-mimic tail binds directly 
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to the transcription-elongation complex 
PAFIC (Fig. 1). PAF1C accompanies the 
transcription enzyme RNA polymerase I dur- 
ing the formation of messenger RNA, and is 
thought to recruit the Setl enzyme that leads 
to H3K4 methylation’. The authors propose 
that NS1 binding to PAFIC interferes with 
gene transcription in the host cell. Indeed, the 
researchers found that when cells from human 
lung tissue were infected with the H3N2 
virus, transcription of rapidly inducible genes 
was halted. But when the cells were infected 
with H3N2 viruses that had been mutated 
so that they could no longer bind PAFIC, 
transcription of these genes was unaffected. 
Furthermore, the authors provide compelling 
evidence that it is specifically the transcrip- 
tion of antiviral genes that is interfered with 
by the interaction between NS1’s histone- 
like tail and PAFIC. Thus, histone mimicry 
seems to support viral infection, and may 
give a selective advantage to viruses that have 
this ability. 

Is this mechanism unique to H3N2 viruses, 
and could it be a recent evolutionary adap- 
tation? Moreover, does possession of this 
mechanism have a role in the variation in 
infectivity and disease severity that is seen 
among influenza strains? The H3N2 virus is 
an emerging influenza subtype that has caused 
only a limited number of human infections 
during the 2011-12 flu season. However, 
they seem to cause illness of around the same 
severity as that induced by standard seasonal 
influenza viruses, which do not contain the 
NS1 histone-mimic tail’. It will be necessary 
to assess other emerging influenza strains for 
histone mimicry to determine whether this 
mechanism is evolutionarily new, and to 
study more influenza cases to predict its 
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Figure 1 | Interfering influenza. Histones are the building-block proteins around which DNA is wound 

to form chromatin. Epigenetic changes to histones, such as the addition of methyl groups (blue circles), can 
affect the rate of gene transcription at specific DNA sequences. a, The amino-terminal (NH,) ‘tail’ of the 
human H3 histone protein contains a string of amino acids (ARTK) that can be methylated at the lysine (K) 
residue by a methyltransferase enzyme, Set1. This enzyme is recruited by a protein complex called PAFIC, 
which also promotes elongation of RNA molecules during gene transcription. Methylation of the lysine in 
ARTK of the H3 histone tail is associated with active gene transcription. b, Marazzi and colleagues’ show that 
the NS1 protein of the H3N2 influenza virus contains an amino-acid sequence (ARSK, close to the protein's 
carboxy (COOH) terminus) that mimics the H3 ARTK sequence. Using this sequence, viral NS1 can bind 
directly to PAF1C, which might direct Set1’s methylation activity to NS1 rather than to the H3 histone. The 
authors also show that NS1 is deposited at the promoter regions of the host's DNA to which methylated H3K4 
would typically bind, and that this interference suppresses the expression of antiviral genes. 


416 | NATURE | VOL 483 | 22 MARCH 2012 


© 2012 Macmillan Publishers Limited. All rights reserved 


potential impact on public health. Even if his- 
tone mimicry is not widespread and does not 
dramatically alter disease severity, Marazzi 
and colleagues’ discovery may provide a novel 
target for antiviral drugs aimed at modulating 
the host's antiviral response. 

The finding raises another intriguing 
question: what would happen if a person is 
concurrently infected with H3N2 virus and 
a more virulent influenza virus, such as the 
highly pathogenic avian H5N1? Or what 
might occur if an H5N1 virus acquires the 
NS1 sequences required for the protein to 
function as a histone mimic? The combined 
effect of a suppressed antiviral response anda 
highly virulent virus might lead to devastating 
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consequences for the host. Alternatively, if an 
overly active host response is itself the main 
culprit in causing severe influenza, as has been 
suggested’, a suppressed antiviral response 
could actually reduce disease pathology. 

Research aimed at addressing these 
questions may be hampered by an increas- 
ing wariness about generating recombinant 
H5N1 viruses’. But ongoing influenza research 
remains essential. Although epigenetic gene 
regulation is new to scientists, other mecha- 
nisms of gene regulation no doubt remain to be 
discovered, and we can be certain that viruses 
have already evolved to interfere with these 
regulatory processes, and to use them to their 
advantage. = 


Plasmons go quantum 


Observations of collective electron waves in metal nanoparticles challenge our 
understanding of how light interacts with matter on small scales and underscore 
the need to factor quantum effects into nanophotonics. SEE ARTICLE P. 421 


F. JAVIER GARCIA DE ABAJO 


ust as water ripples travel along the 
surface of a pond after a stone is thrown 
into it, collective oscillations of conduction 
electrons called plasmons can be produced 
in a metal and propagate along its surface. 
Like pond ripples, plasmons can generally be 
explained in terms ofa classical theory — elec- 
tromagnetism. But if the metal structure on 
which they reside is very small (less than about 
10 nanometres across), quantum-mechanical 
effects emerge and become relevant, chang- 
ing the plasmons oscillation frequency and 
lifetime. Such changes have so far been poorly 
understood, particularly on the experimental 
front. On page 421 of this issue, Scholl et al.' 
describe an experiment, and an accompanying 
theoretical analysis, that addresses this issue. 
When light is shone on nanometre-sized 
particles made of silver and gold, it creates 
plasmons that have frequencies in the optical 
and near-infrared parts of the spectrum. The 
intense colours of the Lycurgus Cup, which 
dates back to fourth-century Rome and is 
on display at the British Museum in London, 
are an example of nanoparticle plasmons at 
work. Plasmons are unique in their ability to 
enhance the intensity of incident light and to 
concentrate the light down to regions much 
smaller than its wavelength. These features 
are the main reason that plasmons are finding 
use in a wide range of applications, including 
improved photovoltaic devices’, cancer ther- 
apy’ and catalysis’. 
Much effort is being invested in engineering 
small metallic structures so as to manipulate 


plasmons and optimize their performance. 
As a result, a plasmon taxonomy has been 
developed that is mostly based on the classi- 
cal electromagnetic description of structured 
metals. This classical approach is useful for 
accurately describing plasmons in structures 
larger than about 10 nm. But it fails for smaller 
structures, which are of utmost importance for 
the applications mentioned above. 


a 
Plasmon 
b 
Unoccupied 
Quantum energy 
electron levels 
transition 
Occupied 
energy 
levels 


NEWS & VIEWS | RESEARCH | 


Alexei L. Krasnoselsky and Michael G. Katze 
are in the Department of Microbiology and 
Washington National Primate Research 
Center, University of Washington, Seattle, 
Washington 98195-8070, USA. 

e-mails: akrasnos@u.washington.edu; 
honey@u.washington.edu 


1. Marazzi, |. et al. Nature 483, 428-433 (2012). 

2. Berger, S.L. Nature 447, 407-412 (2007). 

3. Ng, S.S., Yue, W. W., Oppermann, U. & Klose, R. J. 
Cell Mol. Life Sci. 66, 407-422 (2009). 

4. Garcia-Sastre, A. Virus Res. 162, 12-18 
(2011). 

5. Jaehning, J. A. Biochim. Biophys. Acta 1799, 
379-388 (2010). 

6. Kash, J. C. etal. Nature 443, 578-581 (2006). 

7. Kawaoka, Y. Nature 482, 155 (2012). 


Observing plasmons in particles smaller 
than 10nm is challenging because they inter- 
act only weakly with light. For example, their 
optical absorption is simply proportional to 
their volume. It is thus understandable that 
quantum effects emerging in such particles 
have been investigated only in ensembles*. To 
make things more complicated, collections 
of sub-10-nm particles are hard to synthesize 
with narrow distributions of particle size and 
shape, so that conventional far-field optical 
studies have relied only on measurements of 
averages over such distributions. 

Scholl and co-workers’ circumvent these 
problems by examining individual particles 
using an electron microscope. They have 
excited plasmons by focusing fast electrons on 
silver nanoparticles with sub-nanometre preci- 
sion; the electromagnetic interaction between 
fast electrons and plasmons is analogous to 
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Figure 1 | Quantum effects in nanoparticle plasmons. a, In particles larger than about 10 nanometres, 
plasmons emerge as collective oscillations of a gas of conduction electrons, and have a frequency that is 
uncertain (double-headed arrow) because of collisions among the electrons and between the electrons and 
the particles’ atomic lattice. b, In particles smaller than 10 nanometres, plasmons are associated with quantum 
electron transitions between occupied and unoccupied energy levels. As a result, the plasmon frequency and 
its uncertainty, which Scholl et al.' accurately measured, are larger than those for bigger particles. 
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the interaction of light with plasmons. Using 
state-of-the-art electron energy-loss spectros- 
copy; the authors have measured the plasmon 
frequencies with sufficient accuracy to separate 
surface- and bulk-plasmon components and 
also to resolve tiny changes in the frequency 
of surface plasmons with particle size. In clas- 
sical electrodynamics, no size dependence is 
expected. However, Scholl and colleagues have 
observed a sizeable increase (about 15%) in 
the plasmon frequency of their smallest nano- 
particles (roughly 2 nm across) compared with 
larger ones (larger than 10nm), accompanied by 
a decrease in the plasmon lifetime of about 50%. 

In a simple model, conduction electrons 
moving at about one million metres per 
second take approximately 10°“* seconds to 
travel across a 10-nm silver particle. This is 
comparable to the observed plasmon lifetime 
of such particles, which is about 10 plasmon 
periods. Under the assumption that reflections 
of the electrons at the surface contribute to the 
reduction in the plasmon lifetime, a model 
has been formulated’ that is widely used and 
accounts for the observed lifetime reduction 
in assemblies of sub-10-nm particles. But the 
increase in plasmon frequency observed by 
Scholl et al.’ in their individual sub-10-nm 
silver particles cannot be explained by this 
model, and requires quantum-mechanical 
effects to be understood in detail and carefully 
accounted for. 

In nanoparticles larger than 10 nm, the con- 
duction electrons respond to electromagnetic 
fields as a classical electron gas. In this gas, 
the contribution of each electron to the plas- 
mons cannot be observed, and the plasmon 
frequency is uncertain because of collisions 
of the electrons with each other and with the 
metals atomic lattice (Fig. 1a). 

By contrast, in nanoparticles smaller than 
10 nm, because there are relatively few con- 
duction electrons participating in the plas- 
mons (about 250 electrons in a 2-nm particle), 
the electrons appear at a discrete set of energy 
levels, which are increasingly separated from 
one another as the particle size is reduced. 
This produces individual electron transitions 
between occupied and unoccupied electron 
energy levels. The frequency jumps of these 
transitions are centred around a value that is 
higher than the plasmon frequency of large 
nanoparticles (Fig. 1b), and the finite separa- 
tion between the frequency jumps increases 
the uncertainty about the plasmon frequency 
and therefore reduces plasmon lifetime’. In 
fact, individual transitions cannot in general 
be resolved owing to the uncertainty pro- 
duced by the collisions mentioned previously. 
Using a simplified model that incorporates 
the effects of individual electron transitions, 
Scholl et al.' have successfully explained 
their data. 

By measuring the plasmon-frequency 
increase and lifetime reduction in sub-10-nm 
particles, and by modelling these effects 


in terms of quantum transitions, Scholl and 
co-workers’ have shown that the quantum 
nature of conduction electrons emerges in 
plasmons of small nanoparticles. So far, their 
study has focused on spherical nanoparti- 
cles. But it is expected that quantum effects 
should also have a role in nanometre-sized 
gaps between metallic nanoparticles and 
in sharp metal tips. These gaps and tips can 
create localized plasmon ‘hotspots’ that are rel- 
evant for applications such as nonlinear optics’ 
and ultrasensitive, single-molecule detection 
(for example, through an effect known as 
surface-enhanced Raman scattering’). 
Prospects for using plasmons to fabricate 
quantum-optics devices in robust solid- 
state systems also depend on the interaction 
between molecules and plasmon hotspots’. 
In such systems, quantum effects such as 
non-locality cannot be ignored because they 
produce sizeable plasmon-frequency shifts 
and reduced plasmon lifetimes. There are, 


NEURODEGENERATION 


of course, unknowns on the long road to 
understanding plasmons at the nanometre 
scale, but Scholl and colleagues’ study is a step 
in that direction. m 
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Trouble in the cell’s 


powerhouse 


Studies of rare hereditary disorders are intended to find treatments, but they can 
also bring other discoveries. One such study links the dysfunction of a protein to 
that of the cell’s energy producers, the mitochondria. 


DEREK P. NARENDRA & RICHARD J. YOULE 


dered’ In neurological diseases known as 

ataxias, the lack of order is mainly one of 
movement: patients cannot coordinate volun- 
tary movements because of the loss of certain 
neural circuits, making it difficult for them to 
walk, reach for objects and speak. Increasingly, 
researchers are finding’ that the underlying 
cause of some hereditary ataxias is another sort 
of disorder — that of mitochondria, the intra- 
cellular organelles that act as the cell’s power- 
houses. However, we know very little about the 
mechanisms that trigger certain other heredi- 
tary ataxias, such as the condition known as 
autosomal recessive spastic ataxia of Char- 
levoix-Saguenay, or ARSACS. The disease is 
caused by mutations in the SACS gene’, but the 
link between the mutations and the neuronal 
loss has been unclear. Writing in Proceedings of 
the National Academy of Sciences, Girard et al.* 
find that mitochondrial dysfunction is just 
such a link, and in the process provide clues 
about the cellular role of the SACS-encoded 
protein, sacsin”. 


IE Greek, ‘ataxic (atdétc) means ‘disor- 


“This News & Views article was published online 
on 7 March 2012. 
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The ataxia in ARSACS is due to a progres- 
sive loss of neurons called Purkinje cells, 
which are found in the cerebellum’, a part 
of the brain involved in muscle coordina- 
tion. Affected patients carry mutations in 
both chromosomal copies of SACS, so that 
their cells lack functional sacsin*®. The dis- 
order was first described® in 1978 in patients 
from the Charlevoix and Saguenay regions 
of Quebec, Canada, where it is much more 
common than in the rest of the world. One 
in 22 of the regions’ residents carries a muta- 
tion in one of their two copies of SACS — 
probably brought by French immigrants 
four centuries ago. 

To understand sacsin’s functions, Girard 
et al.* analysed the location of sacsin within 
various types of cells, including neurons, in cul- 
ture and observed that the protein co-localizes 
with mitochondria. Moreover, mitochondria 
in mouse cells in which sacsin expression had 
been inhibited, and in cells from patients with 
ARSACS, had markedly abnormal shapes: they 
were larger and formed more clumps than 
mitochondria in normal cells. The authors 
then generated mice that lacked the protein. 
These animals progressively lost Purkinje cells 
as they aged, making them potential models 


for studying at least some aspects of ARSACS. 
Taken together, Girard and colleagues’ results 
suggest that sacsin deficiency results in unusu- 
ally shaped, large mitochondria, and that this 
in turn leads to Purkinje-cell loss. 

The shape of mitochondria is dynamically 
controlled by their fission and fusion. But 
why do unusually shaped mitochondria harm 
Purkinje cells more than other cells? Each Pur- 
kinje cell has many long, branching projections 
that form an extensive tree-like structure 
(Fig. 1a), establishing about 150,000 synaptic 
connections with other neurons’. The synapses 
are energy-demanding because of the need for 
continual pumping of ions through the cell 
membrane — to keep the ‘lines’ clear for neural 
transmission — and thus they require func- 
tional mitochondria to be located nearby. To 
enter the narrow neuronal projections, small 
mitochondria need to be pinched off by fission 
from large ones’ (Fig. 1b). Moreover, fis- 
sion has a role in preserving mitochondrial 
health by eliminating organelles that become 
damaged by oxidative molecules produced 
during normal mitochondrial activity’. 
Purkinje cells seem to be highly dependent on 
the quality and efficient distribution of mito- 
chondria throughout the cell, because their 
robust activity carries a high oxidative bur- 
den and they have a higher number of narrow 
prolongations than other cells. Indeed, Girard 
et al.” find that the clumped, fused mitochon- 
dria in sacsin-deficient neurons fail to distrib- 
ute into the cellular prolongations (Fig. 1c), and 
that damaged mitochondria seem to be more 
abundant than in normal cells. 

The authors’ findings’ raise several ques- 
tions. For example, how does sacsin affect the 
size and distribution of mitochondria? Girard 
and colleagues propose that the protein may be 
needed for efficient fission of the organelle. In 
addition to the abnormal mitochondrial shapes 
in sacsin-deficient cells, the researchers* report 
that sacsin binds to the Drp1 protein, which 
is essential for mitochondrial division. Dur- 
ing fission, Drp1 polymerizes to form a spiral 
around the outside of the organelle, constricting 
it and facilitating its division”””. 

The hypothesis that sacsin is required for 
full Drp1 activity is consistent with previous 
research''” showing that mice with brain- 
specific Drp1 deficiency display features 
similar to those observed by Girard et al.* in 
sacsin-deficient mice — a loss of Purkinje 
cells, together with the presence of oversized 
mitochondria that fail to spread into the neu- 
ronal projections. And it has been reported® 
that damaged mitochondria are not efficiently 
degraded in cultured cells in which Drp1] activ- 
ity is inhibited. Therefore, the dependence of 
Purkinje cells on the shape and health of mito- 
chondria may explain why they are preferen- 
tially lost in Drp1- and sacsin-deficient mice, 
and in patients with ARSACS. 

Additional questions remain, such as why 
sacsin binds to Drp1 and how this binding 
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Figure 1 | Out-of-shape organelles lead to disease. a, Neurons known as Purkinje cells have many long, 
branching prolongations and are found in the cerebellum, an area of the brain that controls movement. 
These cells progressively die off in patients with a neurological disease called autosomal recessive spastic 
ataxia of Charlevoix-Saguenay (ARSACS), which is caused by mutations in the gene encoding the protein 
sacsin. b, Mitochondria are usually found throughout the body of normal neurons, including in the narrow 
cellular prolongations. c, Girard et al.’ find that mitochondria in sacsin-deficient mouse cells and in cells 
from patients with ARSACS are larger than those in normal cells and accumulate in clumps. Such large 
mitochondria do not distribute into the cellular prolongations in sacsin-lacking neurons. These results, 
together with other data from sacsin-deficient mice, suggest that mitochondrial dysfunction is involved in 


the death of Purkinje cells in patients with ARSACS. 


may affect mitochondrial shape. Some por- 
tions of the sacsin protein are similar to chap- 
erones (proteins that help other proteins to 
fold efficiently), so it is tempting to speculate 
that sacsin may be needed for Drp1 or other 
proteins involved in mitochondrial division 
to assume their proper conformation. The 
link between sacsin and Drp1 also raises the 
possibility that sacsin regulates the division 
of peroxisomes, another type of intracellular 
organelle. Drp1 elimination in mice results 
in abnormally long peroxisomes", and so 
it would be interesting to test whether the 
cells of patients with ARSACS contain longer 
peroxisomes. 

Girard and colleagues’ study of ARSACS 
strikes a common theme in research into 
neurodegenerative disorders: insight from 
clinical studies reveals new perspectives for cell 
biology, and vice versa. For instance, the study 
of families affected by optic atrophy type 1 
revealed the protein OPA] as being essential for 
mitochondrial fusion’*”’. And research in the 
fruitfly revealed that the parkin protein — the 
dysfunction of which is the leading cause of 
recessive Parkinson's disease — regulates 
mitochondrial shape”. These examples dem- 
onstrate the importance of fostering inter- 
action between the clinic and the lab. Thus, 
we can continue to learn about neurological 
disease from model organisms such as the 
fruitfly, and about mitochondrial biology 
from people in Quebec who have ataxia. With 
further collaboration between clinicians and 


scientists, this understanding might translate 
into treatments for these devastating disorders 
in the not-too-distant future. m 
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Quantum plasmon resonances of 
individual metallic nanoparticles 


Jonathan A. Scholl', Ai Leen Koh? & Jennifer A. Dionne! 


The plasmon resonances of metallic nanoparticles have received considerable attention for their applications in 
nanophotonics, biology, sensing, spectroscopy and solar energy harvesting. Although thoroughly characterized for 
spheres larger than ten nanometres in diameter, the plasmonic properties of particles in the quantum size regime 
have been historically difficult to describe owing to weak optical scattering, metal-ligand interactions, and 
inhomogeneity in ensemble measurements. Such difficulties have precluded probing and controlling the plasmonic 
properties of quantum-sized particles in many natural and engineered processes, notably catalysis. Here we 
investigate the plasmon resonances of individual ligand-free silver nanoparticles using aberration-corrected 
transmission electron microscope (TEM) imaging and monochromated scanning TEM electron energy-loss 
spectroscopy (EELS). This technique allows direct correlation between a particle’s geometry and its plasmon 
resonance. As the nanoparticle diameter decreases from 20 nanometres to less than two nanometres, the plasmon 
resonance shifts to higher energy by 0.5 electronvolts, a substantial deviation from classical predictions. We present 
an analytical quantum mechanical model that describes this shift due to a change in particle permittivity. Our results 
highlight the quantum plasmonic properties of small metallic nanospheres, with direct application to understanding and 


exploiting catalytically active and biologically relevant nanoparticles. 


The remarkable growth of nanotechnology has been driven by the ability 
to alter material properties as dimensions are reduced towards the 
atomic scale. Nanomaterials exhibit physical and chemical properties 
very different from those of their bulk counterparts, often resulting from 
enhanced surface interactions or quantum confinement. In the past 
decade, attention has focused on nanoparticles composed of noble 
metals, because they support localized surface plasmon resonances 
(LSPRs). These collective oscillations of the conduction band electrons 
enable strong optical absorption and scattering in subwavelength struc- 
tures, with spectral properties dependent on the nanoparticle’s material, 
size, shape, electronic charge and surrounding medium'”. LSPRs have 
enabled applications including photothermal ablation of cancer cells’ 
and light concentration in thin-film photovoltaics*. Furthermore, the 
sensitivity of LSPRs to changes in the particle’s environment has enabled 
ultrafine sensing, with applications including biomolecular and gas 
detection**, catalytic monitoring of reactant adsorption’”° and sensing 
of electron charge-transfer events!’. 

Although the plasmon resonances of nanospheres with diameters 
exceeding 10 nm have been well characterized’*, LSPRs of nanoparticles 
below this size are poorly understood. In this smaller size regime, 
plasmon resonances become more sensitive to the quantum nature 
of the conduction electrons. Experimentally probing this transition 
has been extremely challenging: optical detection in the far-field is 
hampered by these particles’ diminishing scattering and absorption 
intensities, which scale as the sixth and third power of a particle’s dia- 
meter, respectively'*. Furthermore, the LSPR peaks become broadened 
and damped as the dimensions of the particle decrease well below the 
electron mean free path (52 nm for silver’). The difficulty in observing 
such weak signals has led to conflicting results from experiments 
involving quantum-sized plasmonic particles, with multiple reports 
of LSPR ‘redshifts’ or ‘blueshifts’ as the sphere dimensions are 
reduced'’. For example, ensemble studies of glass-embedded silver 
spheres with sizes ranging from 10nm to 2nm have indicated a 


plasmon resonance blueshift of 0.1 eV (ref. 16). In contrast, ensembles 
of monodisperse silver nanospheres stabilized with oleylamine ligands 
have indicated significant LSPR redshifts of 0.3eV (from 3.1 eV to 
2.8 eV) as the mean particle diameter decreases from 18 nm to 2nm 
(ref. 17). 

To eliminate ensemble heterogeneity effects, other recent experiments 
have examined single metallic nanoparticles. Confocal heterodyne’* and 
photothermal’? techniques were used to observe the absorption of nano- 
particles as their diameter decreased. Citrate-stabilized gold spheres with 
diameters ranging from 30 nm to 5nm have exhibited a plasmon-res- 
onance blueshift of approximately 0.05 eV, indicating a transition to 
quantum-confined effects. However, direct correlation between indi- 
vidual particle geometry, size and spectra was not achieved because 
the specimens were substantially below the optical diffraction limit. 

With such difficulty in experimentally determining the plasmonic 
properties of small metallic particles, theoretical models to explain the 
nanospheres’ deviations from bulk permittivity values have been 
equally challenging to develop and validate. Classical approaches have 
included the addition of size-dependent electron scattering terms in 
the Drude permittivity model, but the use of these alone results in 
predictions of redshifts and imperceptible plasmon resonances at 
small size scales’* that conflict with experimental findings'’*'*"”. Ab 
initio density functional theory (DFT) approaches have also been used 
to predict metallic cluster properties*”*', but are currently limited to 
particles smaller than 1-2 nm, owing to their high computational 
demands”. 

If the plasmon resonances of such quantum-sized particles could be 
understood and used, they would offer unprecedented opportunities in 
biology, catalysis and quantum optics. For example, their reduced 
dimensions would enable improved nanoparticle integration into cells 
for bio-plasmonic imaging or therapeutics***°. Simultaneously, their 
increased surface-to-volume ratios would make them ideal catalytic 
centres or sensors*’. Quantum-sized plasmonic particles would be 
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particularly sensitive to single-molecule adsorbates® or to few-photon 
or few-electron transfer events’*, with potentially profound applica- 
tions for quantum information transfer’. 

In this Article, we examine the plasmonic properties of individual 
nanospheres with diameters ranging from 20 nm to less than 2 nm, 
extending from the classically described regime to the quantum size 
regime. Rather than relying on optical observation, we use aberration- 
corrected TEM imaging in conjunction with monochromated scanning 
transmission electron microscopy (STEM) EELS. Combined, these 
techniques allow direct morphological and spectral analysis of indi- 
vidual nanoparticles, addressing many of the ambiguities of previous 
investigations. STEM EELS has recently yielded unprecedented insight 
into the plasmonic resonances of nanoscale hemispheres*® as well as 
larger spheres”! and prisms”. Here, we explore the plasmon modes of 
silver nanoparticles as their size is reduced to dimensions well below the 
electron mean free path. To minimize any surface chemical interface 
effects, the particles are synthesized without ligands. Our experimental 
results indicate a significant 0.5-eV blueshift in the plasmon resonance 
as the particle size approaches the quantum regime, with only slight 
linewidth broadening. The results indicate an increasingly substantial 
change in nanoparticle permittivity as the diameter is reduced below 
10 nm, which can be well described with a computationally light 
analytical model based on fundamental quantum mechanical 
principles. 


Nanoparticle synthesis and electron spectroscopy 

Figure 1 presents high-resolution micrographs of our particles. The 
Ag nanospheres were synthesized in solution using silver nitrate as the 
metallic precursor and sodium borohydride as both the reducing 
agent and electrostatic stabilizer to prevent agglomeration (see 
Methods)**. The absence of surface ligands on the silver nanoparticles 
is important to reduce the chemical interface effects on the plasmon 
resonance, but does present some synthetic challenges: ligands are 
typically used to prevent particle aggregation and reduce surface 
energy to counterbalance thermodynamic driving forces, allowing 
particle size to be more directly controlled. In this synthesis, electro- 
static repulsion is used as a substitute for surface ligands to prevent 
agglomeration. The resulting silver nanoparticles have a mean dia- 
meter of 8 nm with sizes ranging from 1 nm up to 20 nm. They retain 
spherical-like symmetry for diameters just below 2 nm. 


Spectroscopic analysis of our silver nanoparticles was performed with 
a FEI Titan transmission electron microscope at 300 kV in STEM mode, 
with an imaging spatial resolution of 0.27 nm, a spectral dispersion of 
0.01 eV per pixel, and an EELS zero-loss peak (ZLP) full-width at half- 
maximum of 0.15 eV. Particles were deposited and well-dispersed on 
ultrathin (3 nm) carbon films to minimize the signal from the sub- 
strate during EELS. As a control, we also investigated the plasmon 
resonances of nanoparticles on 8-nm-thick SiO, substrates, which 
yielded very similar results (see Supplementary Information). 

During EELS collection, electrons that interact with the particle 
exhibit an energy loss corresponding to the electromagnetic modes 
of the structure*”**”*”. However, not all incident electrons will excite 
plasmons. In fact, the vast majority of source electrons are able to pass 
through the sample with minimal energy loss, resulting in a dominant 
ZLP on the detector. Owing to its relative magnitude and intrinsic 
Gaussian-like shape, the ZLP contributes a substantial background 
spanning several electronvolts in energy loss. To extract the plasmon 
contribution, techniques must be implemented either to deconvolve 
the ZLP from the collected spectra or to subtract the signal entirely. 
The former was accomplished through the Richardson-Lucy de- 
convolution algorithm, using the spectra of the substrate alone as the 
point spread function and deconvolving it from the particle spectra”. 
Alternatively, the ZLP subtraction was accomplished through two 
techniques: the ‘reflected tail’ method and Gaussian plus Lorentzian 
ZLP curve fitting (see Methods)**. Each of the above three strategies 
separately yielded consistent results in the plasmonic spectra. 

Multiple energy loss spectra were collected for each particle, both 
near the sphere’s edge to observe the surface plasmon resonance 
modes and through its centre to observe the bulk modes. When the 
electron beam is directed in close proximity to the edge of a particle, 
the surface electrons collectively oscillate and the LSPR modes are 
excited*’. The energy of this resonance is dependent on not only 
the material’s permittivity function but also its size, shape and 
surrounding medium. The bulk plasmon resonance, in contrast, 
is the collective oscillation of all the electrons in the material. It 
indicates the frequency of a metal-to-dielectric transition and is 
affected solely by the material’s permittivity. Although the higher- 
energy bulk resonance peak can be observed with electron excitation, 
it cannot be seen in optical measurements, owing to the symmetry of 
the mode”. 


Figure 1| Aberration-corrected TEM images of silver nanoparticles synthesized free of stabilizing ligands. Particles with diameters of 2 nm (a), 3nm 
(b), 4.5 nm (c), 6nm (d), 7.5 nm (e), 9nm (f), 10.5 nm (g), and 12 nm (h) are shown. Scale bars: a-d, 2 nm; e-h, 5 nm. 
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First, we used STEM EELS to measure the plasmon resonances 
of larger, classically described nanoparticles that have been well 
characterized*. Figure 2 illustrates a 20-nm-diameter nanosphere 
that was excited with the electron beam along the edge and through 
the bulk of the particle. Both the STEM-acquired particle image and 
spectra are included. As can be seen, exciting the particle at the edge 
(position (i)) results in a single spectral peak at 3.3 eV, corresponding 
to the particle’s LSPR. Electron excitation near the particle’s interior 
(positions (ii) and (iii)) results in development of a second spectral 
peak at 3.85eV, corresponding to the bulk plasmon of silver. The 
spectral positions of the surface and bulk plasmon resonances remain 
unchanged with the location of the electron source. However, the 
relative weight of the peaks varies, corresponding to the efficiency 
with which each mode is excited. 

We repeated these techniques for individual nanoparticles with 
diameters ranging from 20 nm down to 1.7 nm. Figure 3a illustrates 
the STEM image and energy loss spectra of six individual particles 
with diameters of 11 nm, 8.5 nm, 5.5nm, 3.5 nm, 2.5 nm and 1.7 nm. 
These spectra were generated by directing the electron beam at the 
edge of the particles to excite the LSPR selectively. As the nanosphere 
diameter is decreased, several trends can be noted. First, signal 
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Figure 2 | STEM image of a 20-nm-diameter silver particle and the 
associated deconvoluted EELS data. Spectra were collected by directing the 
electron beam to different locations on the particle, from the edge (i) through 
the bulk (iii). Based on the excitation location, surface or bulk resonances can be 
selectively observed. 
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strength from larger particles is stronger than from smaller spheres, 
as would be expected on the basis of particle volume. However, even 
with the smallest observed particles, the plasmon resonance is still 
significantly above the noise level. Second, the spectral peak widths do 
show some expansion as particle size is decreased, with a full-width at 
half-maximum increasing from 0.4eV to 0.6eV as the diameter is 
reduced from 11 nm to 1.7 nm. Nevertheless, this broadening is sub- 
stantially less than would be expected from ensemble measurements 
for similarly sized small particles, where the resonance peak becomes 
so dampened as to be nearly unrecognizable™. Finally, and most sig- 
nificantly, the energy of the surface plasmon resonance exhibits a 
notable increase as the particle size decreases. 

Figure 3b plots this plasmon resonance energy shift over a collec- 
tion of particle sizes, with surface resonance peaks increasing from 
3.3 eV to 3.8 eV as the particle size decreases. In contrast, classical Mie 
theory would predict only a 0.03-eV blueshift over the same size 
range. Interestingly, the trend towards higher energy resonances is 
not purely monotonic in the EELS data, with a greater variety in peak 
locations occurring at the smallest sizes. 

In Fig. 3b, two standard deviation (95%) confidence intervals are 
included, generated by analysing the particle’s spectra and fitted peak 
location data with a bootstrapping technique (see Methods). The 
confidence intervals indicate that the observed surface resonance peak 
shift is indeed real and is due to alterations in the permittivity as the 
particles enter the quantum size regime. 

Figure 3b also includes a plot of the bulk plasmon shift with 
decreasing particle diameter, generated by directing the electron 
source through the particle’s interior. For large particle dimensions, 
the bulk plasmon peak remains unaltered around 3.85 eV. For dia- 
meters smaller than 10 nm, however, a slight blueshift of 0.1 eV can be 
noticed. Bulk plasmon resonances distinct from LSPRs were observed 
for particles as small as 6 nm in diameter. Smaller particles exhibited a 
nearly indistinguishable spectral response for electron excitation both 
at the edge and at the centre of the particle. As shown below, this 
small-particle response can ostensibly be assigned to the LSPR. 


Modelling the classical-to-quantum transition 

To understand the EELS data of Figs 2 and 3, we have developed an 
analytic model to describe size-dependent changes in plasmon reso- 
nances. The model is rooted in quantum calculations of the metal’s 
electric permittivity ¢ as particle dimensions are reduced towards the 
atomic scale. Though completely analytic, the results are comple- 
mentary to both ab initio DFT calculations*’ and recent developments 
in predicting a system’s electromagnetic response with nonlocal 
dielectric functions*”’. 

According to the quasistatic approximation of Mie theory for small 
particles’***, the LSPR peak occurs when the particle’s polarizability is 
maximized. For a spherical particle, this condition is satisfied when 

Eparticle + 2ernediaeal is minimized. The bulk plasmon resonance occurs 
when the real component of the metal permittivity crosses zero. Thus, 
the spectral properties of plasmonic nanoparticles depend very sensi- 
tively on their dielectric function. 

Classically, the frequency-dependent permittivity of metals can be 
described with a Drude model which determines the material’s dielectric 
response by considering the motion of the free electrons against a 
background of positive ion cores. The general Drude permittivity is 
expressed as: 
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where @ is the angular frequency of the light, «, is the plasma fre- 
quency, a material property that is based on the number density of 
conduction electrons, é,, is a corrective constant that accounts for the 
background electron screening at high frequency, and y represents the 
scattering frequency of the electron as it travels through the metal. To 
increase the energy range across which the Drude model matches the 
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Figure 3 | Correlating Ag nanoparticle geometry with plasmonic EELS data. 
a, Collection of normalized, deconvoluted EELS data from particles ranging 

from 11 nm to 1.7 nm in diameter and the corresponding STEM image of each 
specimen. The electron beam was directed onto the edge of the particles so that 
only the surface resonance is shown. b, Plot of the surface plasmon resonance 


empirical literature data**, we replace the corrective constant ¢,, with 
the frequency-dependent term é;,. This component accounts for the 
contribution of d-band valence electrons that undergo an interband 
transition at high energies (see Supplementary Information). 
Although Drude-like permittivity models accurately predict the 
plasmon spectra for larger nanoparticles (diameters greater than 
10 nm), error significantly increases for reduced particle sizes. For 
nanostructures composed of only a few hundred to a few thousand 
atoms, both increased surface interactions and quantum confinement 
will affect the motion of electrons. Surface interactions will cause the 
scattering frequency y to increase owing to the increased number of 
electron collisions. Existing classical models of small-particle Drude- 
based permittivities have described this change with the expression: 


Y=Youlk + <s (2) 


where R is the particle radius and vp is the Fermi velocity (approxi- 
mately 1.4 10°ms ' in silver‘S). The variable A is a fitting co- 
efficient whose value is still much debated in the literature, spanning 
an order of magnitude from 0.1 to 2 depending on the experimental 
data and physical model'****”. The discrepancies in reported values for 
this coefficient may indicate that by itself, the classical surface scatter- 
ing approach is insufficient to model the plasmon resonances of 
quantum-sized particles accurately. In fact, in contrast to our experi- 
mental results, solely increasing the y value with decreasing particle size 
results in a slight redshift of the plasmon resonance™. 

To fully model small plasmonic particles (diameters of 2-10 nm), 
quantum mechanical effects must also be considered. For such particles, 
the energy levels in the conduction band become discretized and only 
certain electronic or plasmonic transitions are allowed. To account for 
these transitions, we start with the Drude model of equation (1), but 
include Lorentzian terms that can be defined quantum mechanically. 
Following the example of Genzel’® and Kraus*’, we model the conduc- 
tion electrons as a free electron gas constrained by infinite potential 
barriers at the physical edges of the particle. Transition frequencies «j- 
thus correspond to the allowed quantum energies of transitions of 
conduction electrons from occupied states i within the k-space Fermi 
sphere to unoccupied states f immediately outside it. 
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energy versus particle diameter, with the inset depicting bulk resonance 
energies. Horizontal error bars indicating 95% confidence intervals were 
generated with a curve fitting and bootstrapping technique (see Methods). 
Vertical error bars are contained within the size of the data points. 


The relative magnitude of each of these Lorentzian perturbations is 
based on the well-established Thomas—Reiche-Kuhn energy-weighted 
sum rule (also known as the f-sum rule) that are commonly used for 
absorption processes. Encapsulating significant information about the 
energy spectrum and eigenfunctions of a system in a compact form, 
they have enabled experimental confirmation of early quantum theory 
and have since been widely used in solid-state physics as well as atomic 
and nuclear spectroscopy”. Here, the technique is employed in the 
form of oscillator strengths (Sj) that correspond to each transition 
frequency. 

In total, the particle permittivity can be expressed as: 


Si 
2 if 

3 
&(@) ae) Sp rs meres (3) 


where the sum is taken over all initial and final states of the electrons. 
The frequency of the transitions from the occupied to excited states 
can be described as: 
Ey —E; 
oa (4) 
where the energy levels Eyand E; depend on the geometry and poten- 
tial of the system. We assume that the nanocrystal’s physical shape 
closely approximates a sphere and thus treat the conduction electrons 
as particles in an infinite spherical well: 
Wr 
~ 8MR? 
Here, M is the mass of the electron, and n and / are the principal and 
azimuthal quantum numbers respectively. (This assumption of a 
‘spherical well’ potential is justified on the basis of the geometry of 
the particles seen in Fig. 1.) 
The oscillator strength terms, as dictated by the Thomas—Reiche- 
Kuhn sum rule, collectively sum to unity and are described using the 
standard harmonic oscillator quantum mechanical definition: 
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where N is the number of conduction electrons in the nanoparticle. 
The matrix element term <f|z|i> is determined by the allowed 
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wavefunctions of the spherical well model. Further details can be 
found in the Supplementary Information. 

Figure 4a illustrates the real and imaginary components of the 
permittivity, accounting for both quantum mechanical effects and 
surface scattering. Here, the electron scattering term y was allowed 
to vary in accordance with equation (2), using A = 0.25. This value 
was chosen based on fits from single-particle experiments of larger 
spheres on glass substrates’? and DFT simulations of silver nano- 
particles®, both of which are free of the complicating effects of het- 
erogeneity in ensemble measurements. 

As can be seen, the permittivity is highly dependent on the specific 
size of the spherical particle. For small particle diameters (<10 nm), 
the Lorentzian oscillations occur at higher energies as a result of the 
allowed electron transition frequencies. These have an R-* depend- 
ence according to the allowed energies in a spherical well (equation 
(5)). As the particle diameter increases beyond the quantum size 
range (>10nm), the conduction electron transition frequencies 
decrease into the infrared, and the magnitude of the perturbations, 
as dictated by the oscillator strength, also decrease. Accordingly, 
larger particle permittivities converge to bulk experimental data and 
the classical Drude model based on bulk parameters (see Fig. 4). The 
choice ofa larger A coefficient would result in increased plasmon peak 
broadening, but does not significantly affect the general energy trends. 

Using the permittivity values shown in Fig. 4a as an input to elec- 
tromagnetic Mie theory, the absorption efficiencies (absorption cross- 
sections divided by physical cross-sections) of spheres with diameters 
of 2, 4, 6 and 8 nm are determined. The energies of the dipolar reso- 
nances that dominate the particles’ optical spectra are the same as 
when generated through EELS**”®, allowing direct comparison with 
our experimental data (see Supplementary Information). Figure 4b 
illustrates the results, revealing decreasing absorption efficiency and 
blueshifting LSPR peaks as particle size decreases that agree with 
experimental findings. 

The full trend of calculated LSPR energy versus particle diameter is 
shown in Fig. 5a. Particle diameter values are displayed in gradations 
of full metallic monolayers (each adding approximately 0.5 nm), and 
the corresponding absorption spectra for each size is normalized to 
the same intensity colour scale. An effective medium refractive index 
of n = 1.3 is used as a weighted average of the carbon substrate (n = 2) 
and the surrounding vacuum (n = 1). The precise value of the index 
was determined by fitting the theory to the resonances of the largest 
particles (diameters about 20 nm) outside the quantum size regime. 
For comparison at smaller sizes, the full range of experimental EELS 
data was then overlaid, showing excellent agreement with the analytic 
quantum model. Both empirical values and calculations point to 
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quantum size effects beginning to appear at diameters below 10 nm, 
resulting in a strong blueshift of the LSPR peaks compared to classical 
Mie theory predictions (dashed line). 

The increased variability in experimental resonance energies in this 
quantum size regime additionally matches the non-monotonic trend 
in theoretical absorption. In the model, these perturbations are caused 
by higher-order conduction electron transitions (from deeper within 
the Fermi sphere to immediately outside it). Although these reso- 
nances have a weaker influence on the permittivity than the lowest- 
order transitions that dominate in Fig. 4a, their effects can be more 
pronounced for particles of certain diameters: when the radius- 
dependent transition frequencies overlap the overall dipolar 
resonance condition of the particle in its environment (that is, 
| Eparticle + 2émedium| is minimized), additional spectral resonance 
peaks may appear and result in apparent shifts in the LSPR. These 
higher-order transitions have an increasing role for smaller particles 
because their oscillator strengths are relatively greater than those of 
larger spheres. 

Our analytic model provides a straightforward method of under- 
standing changes in the metal permittivity as nanomaterials approach 
the quantum size regime. It remains a relatively coarse approximation, 
however, because three simplifying assumptions are necessary to make 
the model tractable. First, the wavefunctions and allowed energies are 
based on an infinite potential barrier model. In experimental systems 
that involve colloidal particles with stabilizing ligands, the barrier may 
behave in a more finite manner with spill-out effects'*. The nano- 
spheres used in this experiment, however, were synthesized without 
ligands to minimize this phenomenon. Second, the quantum model 
assumes an essentially spherical particle, which becomes increasingly 
difficult to synthesize at sizes below 1.5 nm when there are fewer than a 
hundred atoms. For this reason, we limit the present model and experi- 
mental study to particles larger than 1.5 nm in diameter. However, it 
should be noted that high-symmetry structures approximating spheres 
can be attained at certain ‘magic number’ collections of atoms even 
down to the smallest cluster sizes*’. Lastly, the model assumes that 
there is no interaction between the conduction electrons, a common 
assumption in many analytical quantum models. For a complete 
understanding of nanoparticle properties, ab initio techniques are 
required. 


Comparing analytic and ab initio predictions 

As a means of comparison, we have also used Ag permittivities 
derived from a rigorous DFT model employing a local density 
approximation” to calculate plasmon resonances. In ref. 39, the 
particle dielectric functions were generated through the analysis of 
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Figure 4| Analytic quantum theory of particle permittivity and spectra. 
a, The real (¢,) and imaginary (2) components of the dielectric functions of 
silver nanoparticles with diameters of 2, 4, 6 and 8 nm, calculated using the 
analytic quantum model of equation (3). b, The corresponding absorption 
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spectra of the particles generated by Mie theory. The refractive index of the 
surrounding medium was set to 1.3. As particle size decreases, reduced 
absorption efficiencies and blueshifted surface plasmon peak energies are 
predicted. 
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Figure 5 | Comparison of experimental data with quantum theory. 
Experimental, EELS-determined localized surface plasmon resonance energies 
of various Ag particle diameters are overlaid on the absorption spectra 
generated from the analytic quantum permittivity model (a) and the DFT- 
derived permittivity model (b). The experimental bulk resonance energies are 


high-symmetry silver clusters spanning from Ag); to Agsge. 
Interband (d-band) and intraband (conduction) electron contribu- 
tions were separated, and it was shown that interband effects on 
permittivity remain consistent with those seen in bulk while the particles 
exceed 2 nm in diameter, thus validating our use of a constant é;p term in 
the analytic model. In ref. 39, the intraband contributions to the DFT- 
calculated permittivities were smoothed by single Gaussians to fit the 
relevant Drude model parameters, including the plasma frequency p, 
scattering frequency y and a single resonance frequency wi. Given that 
particle sizes larger than Agsge¢ (diameter 2.7 nm) become increasingly 
computationally demanding using DFT techniques, the trends in these 
parameters were extrapolated to bridge the quantum to classically 
described size span. 

Using this DFT permittivity model and inserting the values into 
Mie theory, we similarly generated the absorption efficiency spectra 
for comparison to our experimental EELS data, as seen in Fig. 5b. The 
empirical and DFT-derived resonances again exhibit excellent agree- 
ment, with blueshifts of 0.5eV observed as the particle decreases 
below 10 nm in diameter. It is additionally worth noting that when 
only the lowest-order electron transitions are allowed in our quantum 
analytic approach, it presents a trend nearly identical to that seen in 
the DFT-derived model (see Supplementary Information). 

Predictions of the bulk plasmonic resonance energies using the 
quantum analytic and DFT-derived permittivity data have also been 
calculated by monitoring the frequency of the metal-to-dielectric 
transition (where ¢, = 0). Note that this bulk frequency differs from 
the plasma frequency , that is calculated by the simple Drude model, 
owing to the additional influence of the interband transitions on the 
permittivity. 

The theory data for the bulk frequency, plotted as lines in Fig. 5a and 
b, shows strong consistency with the experimental values. Both show a 
blueshift for particles below 10 nm because the permittivity is altered 
by quantum size effects. We note that the predicted bulk resonance 
energy is greater than that of the corresponding surface resonance at all 
particle sizes throughout the modelled range. As the sphere diameter 
decreases, however, the LSPR frequency experiences a greater shift and 
the two peaks become harder to distinguish empirically. Additionally, 
with these smallest-sized particles, the relative surface-to-volume ratio 
greatly increases and the probability of bulk excitation decreases 
through a documented begrenzung effect*****’. We therefore predict 
that the surface resonance peak will dominate the peak from the bulk. 
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also included (grey dots) along with the theory prediction (grey line). Classical 
Mie theory peak prediction is given by the dashed white line. The experimental 
data begin to deviate significantly from classical predictions for particle 
diameters smaller than 10 nm. Horizontal error bars represent 95% confidence 
intervals, as calculated through curve fitting and bootstrapping techniques. 


Discussion 


We have, for the first time, systematically observed and thoroughly 
analysed the plasmonic properties of individual silver nanospheres 
with sizes entering the quantum size regime. Using STEM EELS in 
an aberration-corrected electron microscope, we correlate the 
exact nanoparticle size and geometry with its spectra for colloidally 
synthesized particles devoid of the surface damping effects of ligands. 
A substantial blueshift in LSPR resonances is observed as particle size 
decreases, indicating a transition into the quantum-influenced 
regime. We develop an analytical quantum model to describe these 
resonances, which, despite its approximations and relative simplicity, 
agrees remarkably well with DFT methods. This straightforward 
theoretical framework can be directly applied to other metals with 
only the adjustment of the bulk material parameters )4,, @p and éjp. 
Additional particle geometries such as rectangular prisms, cylinders 
and spherical shells can also be analysed through modification of the 
analytic quantum wavefunctions”. 

The combination of this versatile analytic model with direct plasmon 
observation in STEM EELS presents numerous opportunities for 
exploiting the plasmonic properties of quantum-sized particles. We 
anticipate a number of in situ STEM EELS experiments on both metallic 
and semiconducting plasmonic quantum dots”. In particular, real-time 
EELS monitoring of quantum plasmon resonances may yield dynamic 
insight into multi-electron redox reactions catalysed by small metallic 
particles, such as in artificial photosynthesis. Moreover, these quantum 
plasmon resonances may enable novel quantum devices based on 
excitation and detection of single photons or plasmons. Ultimately, 
EELS of quantum plasmonic systems should facilitate new understand- 
ing of electronic-scale processes, powerfully complemented by the 
atomic-scale resolution of aberration-corrected TEM. 


METHODS SUMMARY 


Ligand-free silver nanospheres were synthesized according to a previously 
described technique. 5ml of 1mM AgNO, (>99% Alfa Aesar) were added 
dropwise over 3 min to 15 ml of 2mM NaBH, (Fisher Scientific) that had been 
chilled in an ice bath. The sodium borohydride solution was stirred during the 
silver addition to ensure a homogeneous distribution. 

Particles were dispersed on either ultrathin carbon or silicon dioxide sus- 
pended films. Spectra were collected by directing the electron beam onto the edge 
and centre of the particle, with ten acquisitions of 30 ms each to prevent satura- 
tion of the charge-coupled device (CCD) detector. These collections were aligned 
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according to their ZLP and summed in post-processing. To reduce the back- 
ground from the dominant ZLP and improve the energy resolution, the 
Richardson-Lucy deconvolution algorithm was employed. The point spread 
function, which indicates the energy spread of the electron source and aberrations 
in the spectrometer and CCD, was estimated by the acquisition of ten aligned 
spectra from the nearby substrate alone. 

ZLP subtraction can also be used to reveal the plasmon resonance and was 
accomplished through two different techniques. In the first, ‘reflected tail’ 
method, the negative tail of the ZLP was reflected about the 0-eV axis and sub- 
tracted from the positive side*’. In the second method, the ZLP was fitted to the 
sum of a Gaussian and Lorentzian curve and subtracted from the spectra. We 
separately applied each ZLP-removal method, which yielded consistent results in 
the plasmonic spectra. 

Confidence intervals for resonance peak energies were generated using a boot- 
strapping method in which a locally weighted scatterplot smoothing (LOWESS) 
algorithm was applied to the summed EELS data of each particle. Each curve- 
fitting relies on a randomly selected subset of the spectral data, producing slight 
variations in the calculated peak energy. Using 20 iterations of the algorithm, 95% 
confidence intervals could be generated. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 12 October 2011; accepted 26 January 2012. 


1. Willets, K. A. & Van Duyne, R. P. Localized surface plasmon resonance 
spectroscopy and sensing. Annu. Rev. Phys. Chem. 58, 267-297 (2007). 

2. Juluri, B. K., Zheng, Y. B., Ahmed, D., Jensen, L. & Huang, T. J. Effects of geometry 
and composition on charge-induced plasmonic shifts in gold nanoparticles. 

J. Phys. Chem. C 112, 7309-7317 (2008). 

3. Lal, S., Clare, S. E. & Halas, N. J. Nanoshell-enabled photothermal cancer therapy: 
impending clinical impact. Acc. Chem. Res. 41, 1842-1851 (2008). 

4. Catchpole, K. R. & Polman, A. Design principles for particle plasmon enhanced 
solar cells. Appl. Phys. Lett. 93, 191113 (2008). 

5. Bingham, J.M., Anker, J. N.& Kreno, L. E. Gas sensing with high-resolution localized 
surface plasmon resonance spectroscopy. J. Am. Chem. Soc. 132, 17358-17359 
(2010). 

6. Anker, J. N. et a/. Biosensing with plasmonic nanosensors. Nature Mater. 7, 
442-453 (2008). 

7. Stiles, P.L., Dieringer, J. A., Shah, N.C. & Van Duyne, R. P. Surface-enhanced Raman 
spectroscopy. Ann. Rev. Anal. Chem. 1, 601-626 (2008). 

8. Jeanmaire, D.L.& Van Duyne, R.P. Surface Raman spectroelectrochemistry. Part|. 
Heterocyclic, aromatic, and aliphatic amines adsorbed on the anodized silver 
electrode. J. Electroanal. Chem. Interf. Electrochem. 84, 1-20 (1977). 

9. Larsson, E. M., Langhammer, C., Zoric, |. & Kasemo, B. Nanoplasmonic probes of 

catalytic reactions. Science 326, 1091-1094 (2009). 

0. Liu, N., Tang, M. L, Hentschel, M., Giessen, H. & Alivisatos, A. P. Nanoantenna- 

enhanced gas sensing in a single tailored nanofocus. Nature Mater. 10, 631-636 

(2011). 

1. Novo, C., Funston, A.M. & Mulvaney, P. Direct observation of chemical reactions on 

single gold nanocrystals using surface plasmon spectroscopy. Nature 

Nanotechnol. 3, 598-602 (2008). 

2. Link, S. & El-Sayed, M. A. Spectral properties and relaxation dynamics of surface 

plasmon electronic oscillations in gold and silver nanodots and nanorods. J. Phys. 
Chem. B 103, 8410-8426 (1999). 
3. Bohren, C.F. & Huffman, D. R. Absorption and Scattering of Light by Small Particles 
(Wiley, 1983). 

4. Kreibig, U. & Vollmer, M. Optical Properties of Metal Clusters (Springer, 1995). 

5. Kreibig, U. & Genzel, L. Optical absorption of small metallic particles. Surf. Sci. 156, 
678-700 (1985). 

6. Genzel, L., Martin, T. P. & Kreibig, U. Dielectric function and plasma resonances of 
small metal particles. Z. Phys. B 21, 339-346 (1975). 

7. Peng, S.,McMahon, J. M., Schatz, G. C., Gray, S. K. & Sun, Y. Reversing the size- 

dependence of surface plasmon resonances. Proc. Nat! Acad. Sci. USA 107, 

14530-14534 (2010). 

8. Lindfors, K., Kalkbrenner, T., Stoller, P. & Sandoghdar, V. Detection and 

spectroscopy of gold nanoparticles using supercontinuum white light confocal 

microscopy. Phys. Rev. Lett. 93, 037401 (2004). 

9. Berciaud, S., Cognet, L., Tamarat, P. & Lounis, B. Observation of intrinsic size effects 

in the optical response of individual gold nanoparticles. Nano Lett 5, 515-518 

(2005). 

20. Bakr, O. M. et al. Silver nanoparticles with broad multiband linear optical 
absorption. Angew. Chem. 48, 5921-5926 (2009). 

21. Zhu, M., Aikens, C. M., Hollander, F. J., Schatz, G. C. & Jin, R. Correlating the crystal 
structure ofa thiol-protected Au25 cluster and optical properties. J. Am. Chem. Soc. 
130, 5883-5885 (2008). 

22. Sholl, D.S. & Steckel, J. A. Density Functional Theory: A Practical Introduction (Wiley- 
Interscience, 2009). 


ARTICLE 


23. Cognet, L. et al. Single metallic nanoparticle imaging for protein detection in cells. 
Proc. Natl Acad. Sci. USA 100, 11350-11355 (2003). 

24. Hu, M. etal. Gold nanostructures: engineering their plasmonic properties for 
biomedical applications. Chem. Soc. Rev. 35, 1084-1094 (2006). 

25. Jain, P. K., Huang, X., El-Sayed, |. H. & El-Sayed, M. A. Noble metals on the 
nanoscale: optical and photothermal properties and some applications in 
imaging, sensing, biology, and medicine. Acc. Chem. Res. 41, 1578-1586 (2008). 

26. Sperling, R.A., Gil, P.R., Zhang, F., Zanella, M. & Parak, W. J. Biological applications 
of gold nanoparticles. Chem. Soc. Rev. 37, 1896-1908 (2008). 

27. Templeton, A. C., Pietron, J. J., Murray, R. W. & Mulvaney, P. Solvent refractive 
index and core charge influences on the surface plasmon absorbance of 
alkanethiolate monolayer-protected gold clusters. J. Phys. Chem. B 104, 564-570 
(2000). 

28. Hirakawa, T. & Kamat, P. V. Charge separation and catalytic activity of Ag@TiO2 
core-shell composite clusters under UV-irradiation, J. Am. Chem. Soc. 127, 
3928-3934 (2005). 

29. Akimov, A. V. et al. Generation of single optical plasmons in metallic nanowires 
coupled to quantum dots. Nature 450, 402-406 (2007). 

30. Ouyang, F., Batson, P. E. & Isaacson, M. Quantum size effects in the surface- 

plasmon excitation of small metallic particles by electron-energy-loss 

spectroscopy. Phys. Rev. B 46, 15421-15425 (1992). 

31. Koh, A. L. et a/. Electron energy-loss spectroscopy (EELS) of surface plasmons in 

single silver nanoparticles and dimers: influence of beam damage and mapping of 

dark modes. ACS Nano 3, 3015-3022 (2009). 

32. Nelayah, J. et al. Mapping surface plasmons on a single metallic nanoparticle. 

Nature Phys. 3, 348-353 (2007). 

33. Solomon, S. D. etal. Synthesis and study of silver nanoparticles. J. Chem. Educ. 84, 

322-325 (2007). 

34. Ritchie, R. H. Plasma losses by fast electrons in thin films. Phys. Rev. 106, 874-881 
(1957). 

35. Ferrell, T. L. & Echenique, P. M. Generation of surface excitations on dielectric 
spheres by an external electron beam. Phys. Rev. Lett. 55, 1526-1529 (1985). 

36. Garcia de Abajo, F.J. Optical excitations in electron microscopy. Rev. Mod. Phys. 82, 
209-275 (2010). 

37. Egerton, R. F. Electron energy-loss spectroscopy in the TEM. Rep. Prog. Phys. 72, 
016502 (2009). 

38. Gloter, A, Douiri, A. Tence, M. & Colliex, C. Improving energy resolution of EELS 
spectra: an alternative to the monochromator solution. Ultramicroscopy 96, 
385-400 (2003). 

39. He, Y. & Zeng, T. First-principles study and model of dielectric functions of silver 
nanoparticles. J. Phys. Chem. C 114, 18023-18030 (2010). 

40. Garcia de Abajo, F. J. Nonlocal effects in the plasmons of strongly interacting 
nanoparticles, dimers, and waveguides. J. Phys. Chem. C 112, 17983-17987 
(2008). 

41. McMahon, J. M., Gray, S. K. & Schatz, G. C. Nonlocal optical response of metal 
nanostructures with arbitrary shape. Phys. Rev. Lett. 103, 097403 (2009). 

42. David, C. & Garcia de Abajo, F. J. Spatial nonlocality in the optical response of metal 
nanoparticles. J. Phys. Chem. C 115, 19470-19475 (2011). 

43. Mie, G. Beitrage zur optik trier medien, speziell kolloidaler metallésungen. Ann. 
Phys. 330, 377-445 (1908). 

44. Johnson, P.B.& Christy, R. W. Optical constants of the noble metals. Phys. Rev. B 6, 
4370-4379 (1972). 

45. Kreibig, U. & Fragstein, C. V. The limitation of electron mean free path in small silver 

particles. Z. Phys. 224, 307-323 (1969). 

46. Alvarez, M. M. et al. Optical absorption spectra of nanocrystal gold molecules. 

J. Phys. Chem. B 101, 3706-37 12 (1997). 

47. Hovel, H., Fritz, S., Hilger, A. Kreibig, U. & Vollmer, M. Width of cluster plasmon 

resonances: bulk dielectric functions and chemical interface damping. Phys. Rev. B 

48, 18178-18188 (1993). 

48. Kraus, W. A. & Schatz, G. C. Plasmon resonance broadening in small metal 

particles. J. Chem. Phys. 79, 6130-6139 (1983). 

49. Palik, E. D. Handbook of Optical Constants of Solids (Elsevier, 1998). 

50. Luther, J. M., Jain, P. K., Ewers, T. & Alivisatos, A. P. Localized surface plasmon 
resonances arising from free carriers in doped quantum dots. Nature Mater. 10, 
361-366 (2011). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank S. Sheikholeslami, A. Atre, A. Garcia-Etxarri and A. Baldi 
for discussions. This research was supported by the National Science Foundation 
Graduate Research Fellowship Program. J.A.D. acknowledges support from a Stanford 
Terman Fellowship and a Robert N. Noyce Family Faculty Fellowship. 


Author Contributions J.A.S. performed the experiment, analysed the data, and 
developed the model. A.L.K. provided substantial assistance with the STEM EELS 
procedure. J.A.D. guided and supervised the experiments and analysis. All authors 
contributed to writing and editing the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of this article at 
www.nature.com/nature. Correspondence and requests for materials should be 
addressed to J.A.D. (jdionne@stanford.edu) or J.A.S. (jscholl@stanford.edu). 


22 MARCH 2012 | VOL 483 | NATURE | 427 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


METHODS 

Chemicals and synthesis. Ligand-free silver nanospheres were synthesized accord- 
ing to a previously described technique”’. 5 ml of 1 mM AgNO; (>99% Alfa Aesar) 
were added drop-wise over 3 min to 15 ml of 2mM NaBH, (Fisher Scientific) that 
had been chilled in an ice bath. Both solutions were prepared with Milli-Q deionized 
water (18 MQcm~'). The sodium borohydride solution was stirred vigorously 
during the silver addition to ensure a homogeneous distribution. The metallic 
particles remained separated owing to the charge on the surrounding BH, ions. 
Electron energy-loss spectroscopy. The Ag nanospheres were dispersed on 
ultrathin (3 nm) carbon films (Ted Pella, Inc.) for optimal TEM imaging and minimal 
signal from the substrate. To control for any carbon background effects in the EELS 
data, the nanoparticles were also examined on 8-nm-thick SiO) substrates (Ted Pella, 
Inc.). Silver spheres on both support films exhibited nearly identical blueshifts with 
decreasing particle dimensions (see Supplementary Information). 

A FEI Titan 80-300 environmental (scanning) transmission electron micro- 
scope was employed for particle imaging and EELS collection at 300 kV. EELS 
experiments were performed in scanning, monochromated mode, allowing an 
imaging resolution of 0.27 nm and a EELS zero-loss peak (ZLP) full-width at half- 
maximum of 0.15 eV. The Gatan Tridiem 866 spectrometer achieved a spectral 
dispersion of 0.01 eV per channel. A 50 jtm C3 aperture, camera length of 48 mm, 
and a spectrometer entrance aperture of 2.5mm were used during the data 
acquisition. This corresponds to a convergence semi-angle of 9.2 mrad and a 
collection semi-angle of 16.8 mrad. 


Spectra were collected by directing the beam onto the edge and centre of the 
particle, with ten acquisitions of 30 ms each to prevent saturation of the charge- 
coupled device (CCD) detector. These collections were aligned according to their 
ZLP and summed in post-processing. To reduce the background from the dominant 
ZLP and improve the energy resolution, the Richardson-Lucy deconvolution algo- 
rithm was employed. The point spread function, which indicates the energy spread 
of the electron source and aberrations in the spectrometer and CCD, was estimated 
by the acquisition of ten aligned spectra from the nearby substrate alone. The 
deconvolution algorithm was iterated five times to maximize peak visibility while 
minimizing the introduction of artefacts. 

A ZLP subtraction can also be used to reveal the plasmon resonance and was 
here accomplished through two different techniques. In the first, ‘reflected tail’ 
method, the negative tail of the ZLP was reflected about the 0-eV axis and sub- 
tracted from the positive side*'. The second method involved fitting the ZLP to 
the sum of a Gaussian and Lorentzian curve and subtracting the fit from the 
spectra. We separately applied each ZLP-removal method on the collected EELS 
data, with all methods yielding consistent results in the plasmonic spectra. 

Confidence intervals for resonance peak energies were generated using a boot- 
strapping method in which a locally weighted scatterplot smoothing (LOWESS) 
algorithm was applied to the summed energy loss spectra of each particle. Each 
curve fitting relies on a randomly selected subset of the spectral data, producing 
slight variations in the calculated peak energy. Using 20 iterations of the 
algorithm, 95% confidence intervals could be generated. 
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an influenza histone mimic 
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Viral infection is commonly associated with virus-driven hijacking of host proteins. Here we describe a novel 
mechanism by which influenza virus affects host cells through the interaction of influenza non-structural protein 1 
(NS1) with the infected cell epigenome. We show that the NSI protein of influenza A H3N2 subtype possesses a 
histone-like sequence (histone mimic) that is used by the virus to target the human PAF! transcription elongation 
complex (hPAF1C). We demonstrate that binding of NS1 to hPAFIC depends on the NS1 histone mimic and results in 
suppression of hPAF1C- mediated transcriptional elongation. Furthermore, human PAF1 has a crucial role in the antiviral 
response. Loss of hPAFIC binding by NS1 attenuates influenza infection, whereas hPAFIC deficiency reduces antiviral 
gene expression and renders cells more susceptible to viruses. We propose that the histone mimic in NS1 enables the 
influenza virus to affect inducible gene expression selectively, thus contributing to suppression of the antiviral response. 


Histones are essential regulators of genome function in eukaryotic cells’. 
The amino-terminal domain of histones (the histone tail) provides a 
scaffold for the assembly of protein complexes controlling gene activity’. 
This role of a histone tail is achieved largely by post-translational histone 
modifications, catalysed by distinct modifying enzymes*. The combina- 
tion of diversely modified histone tails establishes gene-region-specific 
patterns that, upon recognition by histone-binding proteins, contribute 
to the regulation of gene replication’, repair® and transcription’. 

The ability of histone tails to guide gene function indicates the 
possibility of targeted control of gene expression by artificial or 
naturally occurring molecules that can structurally and/or functionally 
mimic the histone tail. The former possibility has been demonstrated 
by studies that show the ability of synthetic compounds to interfere 
with inducible gene expression by abolishing the interaction between 
the acetylated histone H4 and the BET family of transcriptional 
regulators*”. Histone binding to transcriptional regulators could also 
be interrupted by exogenous cell-permeable histone peptides'*"’. This 
approach underscores the possibility of a competition between 
endogenous and exogenous histone tails for the common binding 
partners. Indeed, the histone H3 tail-like sequence (histone mimic) 
within histone methyltransferase G9a can compete, in a modification- 
dependent fashion, for binding to the histone-bound heterochromatin 
protein 1 (HP1)'*. Overall, the 3-5-amino-acid-long sequences that 
match various parts of the histone tail can be found in numerous 
eukaryotic and prokaryotic proteins; however, the role of these 
presumptive histone mimics in the regulation of gene activity remains 
unknown. 


Influenza virus NSI carries a histone H3-like sequence 

Pathogens have a known ability to interfere with vital processes in the host 
cells by mimicking regulatory components of host protein networks’. In 
searching for naturally occurring histone mimics with a gene regulatory 
capability, we screened in silico for pathogen-derived proteins that have 


a known or predicted capacity to accumulate within the nuclei of 
infected cells, and bear a histone-tail-like unstructured domain at the 
amino or carboxy termini. One of these proteins was found to be 
the influenza A virus NS1 protein that suppresses host response to 
influenza virus”. 

Depending on the viral subtype, NS1 is a 219-237-amino-acid-long 
protein'*. NS1 is not essential for formation of the viral particle but is 
critical for counteracting the antiviral cell response'*”’. In the absence 
of NS1, influenza is greatly attenuated’*. Suppression of antiviral host 
response by NS1 relies partly on the ability of NS1 to interfere with 
cytosolic signalling processes that regulate the expression of type I 
interferon genes'*’. Additionally, the NS1 protein can affect host 
gene expression by interfering with RNA splicing and messenger 
RNA export”. 

We found that NS1 of influenza A (H3N2) virus carries a sequence that 
resembles the histone H3 tail. Similar to the histone tail, the C terminus of 
NS1 comprises a non-structured and potentially highly interactive 
domain**”° (Fig. 1a). Although this unstructured C-terminal domain is 
present in most of the human influenza A variants (Fig. 1a), only NS1 
from H3N2 subtype (hereafter defined as NS1) possesses the ARSK 
sequence (amino acids 226-229) that is chemically analogous to the 
"ARTK® sequence that comprises the lysine 4 site (H3K4) of histone 
H3 (Fig. 1a). The similarity between histone H3 and NS1 tails is further 
strengthened by the ability of the NS1 tail to serve as a substrate for 
histone-modifying enzymes in vitro. Incubation of the NS1 tail peptide 
with lysine methyltransferase Set1 complex or Set7/9 in the presence of 
S-adenosylmethionine resulted in incorporation of the methyl mark 
into the peptide (Fig. 1b, upper panel). Accordingly, substitution of 
lysine 229 to arginine (K229R) prevented methylation. These results, as 
well as the ability of histone acetyltransferase TIP60—which acetylates 
histone H3 lysine 4 in yeast*°—to acetylate the NS1 peptide at K229 
(Fig. 1b, upper panel), support the histone mimicry by NS1. The ability 
of the H3K4-like sequence within the NS1 tail to serve as a substrate 
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Figure 1 | Influenza NS1 contains a histone mimic. a, The homologous 
carboxy-terminal NS1 and the amino-terminal histone H3 sequences are 
shown (red letters). The table displays C-terminal NS1 sequences of the 
influenza A subtypes. b, Methylation or acetylation of the NS1 peptide (top 
panel), the GST-NS1 protein (middle panel) or of the viral NS1 in A549 
infected cells (bottom panel) are shown. KR, NS1 substrates where K229 is 
replaced by arginine. c, Association of the NS1 histone mimic with the hPAF1C 


for chosen histone-modifying enzymes has been further confirmed 
by using recombinant GST fusion NS1 protein (Fig. 1b, middle panel). 
Finally, the H3K4-like site within NS1 could be methylated or 
acetylated in virally infected cells. Using a custom-made antibody 
against dimethylated lysine 229 in NS1 (NSlme2; Supplemen- 
tary Fig. 1), or acetyl-specific antibody”’, we found the presence of 
methylated and acetylated NS1 in infected human cells (Fig. 1b, lower 
panel). Point mutation of the NS1 gene that causes K229R substitution 
prevented modification (Fig. 1b, lower panel). It is likely that members 
of the H3K4 methyltransferase and acetyltransferase enzymes, that can 
modify NS1 in vitro, are responsible for NS1 modification in vivo. 
However, in view of the known redundancy between multiple H3K4 
modifying complexes***”°, identification of a sole NS1 modifying 
enzyme seems unlikely. 


NS1 interacts with transcript-elongating proteins 


Our experiments show the presence of a histone mimic within the NS1 
tail. In histones, the unmodified or modified histone tail contributes to 
the assembly of chromatin protein complexes**~’. The nature of the 
histone-tail-binding proteins is commonly revealed by the identifica- 
tion of nuclear proteins that bind to immobilized histone peptides 
during affinity purification®™*. Using this approach (Supplementary 
Fig. 2), we searched for human nuclear proteins that bind to the NS1 
tail peptide (amino acids 220-230). To control for the specificity of 
binding we used a scrambled peptide with an amino acid composition 
identical to NS1. Mass-spectrometry analysis of NS1-tail-bound 
proteins revealed the association of the NS1 tail with polypeptides that 
belong to the human PAF1 transcription elongation (hPAF1C) and 
CHD1 chromatin remodelling complexes (Supplementary Fig. 2). 
Binding of the NS1 tail to human PAF1, parafibromin (CDC73), 
SNF2l and CHD1 were validated by western blotting (Fig. 1c). In 


LEO1 
PAF1 


CDC73 
RTF1 


subunits and CHD1 in nuclear extracts. In, input material. d, NS1 histone 
mimic binds to CHD1. Unmodified or methylated NS1 (K229) or methylated 
H3 (K4) peptides were incubated with the recombinant CHD1 or the CHD1 
double-chromodomain. Binding to NS1 was revealed by silver or Coomassie 
staining (top and bottom panel, respectively). e, Binding of Flag-tagged 
hPAFIC subunits to NS1 or histone H3 peptides was assessed by western 
blotting (left and right panels, respectively). IP, immunoprecipitation. 


support of the histone mimicry by the NS1 tail, we observed the 
histone-H3-like pattern of the NS1 tail binding to CHD1 in vitro. 
The chromodomain of human CHD1 binds specifically to di- or 
trimethylated lysine 4 of histone H3 (ref. 35). Similarly, the full-length 
CHD1 or the purified double-chromodomain binds selectively to di- or 
trimethylated NS1 peptide, but not to unmodified or acetylated NS1 
tail (Fig. 1d). 

To start assessing the role of the NS1 histone mimic in the regu- 
lation of chromatin complexes, we focused our attention on hPAFIC. 
We chose this on the basis of several factors. First, hPAF1C is a potent 
regulator of RNA elongation”’, a co-transcriptional process that con- 
tributes significantly to pathogen-induced gene expression’’’’. 
Second, PAFIC has been implicated in the regulation of stress-induced 
genes in yeast** that could be functionally equated to pathogen- 
induced genes in mammalian cells. Third, opposite to CHD 1 that binds 
to the NS1 tail in a methyl-dependent fashion, purified hPAF1C binds 
to unmodified as well as modified NS1 tail (Supplementary Fig. 3a, left 
panel), thus making this interaction potentially more versatile. The 
interaction of hPAF1C with the NS1 tail is sequence-dependent. 
Neither NS1 tail of the H5N1 influenza virus, the scrambled NS1 
peptide, nor the truncated NS1 tail that lacks the H3K4-like sequence 
were able to bind to hPAF1C (Supplementary Fig. 3b). 

hPAF1C consists of six distinct subunits (PAF1, LEO1, CDC73 (also 
called parafibromin), SKI8 (also called WDR61), CTR9, RTF1) that 
have different roles in the interaction between hPAF1C and its binding 
partners, including RNA polymerase II (RNA Pol II)’*”?®. To identify 
the primary NS1-tail-interacting hPAF1C subunit(s), we tested 
binding of the NS1 tail to purified recombinant individual hPAF1C 
subunits. We found that human PAF! is the primary binder of the 
unmethylated or methylated NS1 tail, but not the control peptide that 
lacks the histone mimic (Fig. le, left panel). PAF1 does not bind to 
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acetylated NS1 tail, thus pointing to a possible in vivo regulation of 
NS1 binding to PAF1 by a methyl/acetyl-switch mechanism. Finally, 
NS1 interacts with PAF1 in vivo. Immunoprecipitation of NS1 or 
PAF1 from infected cell extract revealed the existence of an NS1- 
PAF1 complex (Supplementary Fig. 3c). Our findings show that direct 
binding of the NS1 histone mimic to PAF1 is responsible for the 
association of NS1 with hPAFIC. 

Interaction of hPAFIC with the NS1 histone mimic raised the 
possibility of hPAF1C binding to the histone H3 tail. Although the 
interaction of hPAF1C with RNA Pol II and other transcriptional 
regulators is well established***’”, the direct association between 
hPAFIC and histone H3 has not been documented. We found that 
unmodified and methylated histone H3 tail peptides, but not the 
scrambled control peptide, bind to hPAF1C (Supplementary Fig. 3a, 
right panel) or to purified PAF1 protein (Fig. le, right panel). Similar to 
the NS1 histone mimic, the acetylated histone H3 tail did not bind to 
either hPAF1C components. 

The ability of homologous tails of NS1 and histone H3 to interact 
directly with hPAF1C indicated a possibility of virus-mediated hijack- 
ing of the transcription elongation machinery from the host genes. We 
found that NS1 accumulates in the nuclei of the infected human cells 
(Supplementary Fig. 4a), where it reaches approximately 5 X 10° 
molecules per nucleus within 12h after infection (Supplementary 
Fig. 4b). The salt-extraction profile of NS1 from nuclei of infected cells 
showed association of NS1 with chromatin (Supplementary Fig. 4c). 
To assess the position of NS1 on chromatin during viral infection, we 
generated a recombinant virus (Flag-NS1) that expresses Flag-tagged 
NS1 protein (Supplementary Fig. 5a, b). Genome-wide analysis of the 
Flag-NS1 binding at 12 h after infection revealed the presence of NS1 
on induced antiviral gene promoters (Supplementary Tables 1 and 2). 
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Figure 2 | Functional interaction between NS1 and PAF! in infected cells. 
a, The ChIP-seq profiles show the distribution of indicated proteins at inducible 
genes before (black line) and after (red line) infection. The induced genes were 
revealed by RNA-seq and ChIP-seq analysis of infected A549 cells 
(Supplementary Tables 1 and 2). TSS and TES, the transcriptional start and end 
sites, respectively. b, The NS1 levels at gene promoters in PAF1- or CHD1- 
deficient cells (blue and green lines, respectively). The scrambled siRNA- 
treated cells (red line) were used as control. The insert shows knockdown of 
PAF1 or CHD1 in A549 cells. c, PAF1, RNA Pol II and H3K4me3 levels at the 
TSS and TES of the induced genes in uninfected (ui) cells, cells infected with the 
wild-type (WT) or PAF1-binding mutant virus (APAF). Data are 
representative of three independent experiments; error bars show the s.e.m. 
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In infected cells, NS1 is enriched at promoters characterized by the 
presence of H3K4me3 and RNA Pol II (Fig. 2a). The pattern of NS1 
distribution on its targets parallels the distribution of hPAF1C 
(Fig. 2a). Collectively, these data revealed a strategic position of NS1 
at sites of active antiviral gene transcription. 


NSI1 histone mimic controls antiviral gene expression 
The interaction between the NS1 tail and hPAFIC has two comple- 
mentary functions that may support viral infection. The virus probably 
uses the NS1-hPAF1C interaction to target sites of active transcription. 
The short interfering RNA (siRNA)-mediated PAF1 deficiency 
resulted in 30% reduction of NS1 levels at the transcriptional start sites 
of the NS1-bound genes in infected cells (Fig. 2b). The specific role of 
PAF in recruitment of NS1 was supported by unaltered NS1 binding 
to its gene targets in cells treated with CHD1-specific siRNA (Fig. 2b). 
These findings revealed a previously unknown ability of the influenza 
virus to use host PAF1 to position viral NS1 protein at sites of active 
gene transcription. 

Once targeted to transcriptionally active loci, as defined by RNA 
Pol II levels cross-referenced to the infected cell transcriptome (Sup- 
plementary Tables 1 and 2), NS1 interferes with PAF1 and RNA Pol II 
abundance at target genes. As exemplified by virus-induced IFIT1 and 
IFI6 genes, infection with the wild-type influenza virus resulted in 
decreased PAF1 and RNA Pol II levels at the transcriptional end sites 
(TES) and to a lesser extent at the transcriptional start sites (TSS) of 
the genes, as compared with PAF1-binding mutant virus (APAF) 
(Fig. 2c). This pattern of PAF1, RNA Pol II and H3K4me3 distri- 
bution is indicative of impaired transcriptional elongation. 

To determine the impact of the NS1 tail on the dynamics of antiviral 
gene transcription, we conducted a genome-wide nuclear ‘run-on’ 
analysis (global run-on sequencing, GRO-seq) of cells infected with 
the wild-type or mutant viruses that cannot bind to human PAF1 
(APAF). In the absence of infection, we observed accumulation of 
RNA species that map to the TSS of antiviral genes and low levels of 
RNAs that map to the gene body (Fig. 3a). Accumulation of TSS- 
associated short RNAs is a common feature of rapidly inducible genes 
that maintain a paused/poised state in the absence of an inducer*’. 
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Figure 3 | NS1 suppresses antiviral gene transcription in infected cells. 

a, Left: the GRO-seq profile of inducible RNA transcripts in uninfected (ui) 
A549 cells (black line) or cells infected with wild-type or APAF virus (green and 
red lines, respectively). Right: GRO-seq profile of IFIT1 and IFI6 genes in 
uninfected and infected cells. b, GRO-seq profile of A549-expressed genes that 
are not affected by virus infection (left panel) or of the HPRT1 gene (right 
panel). Reads from either DNA strands are indicated as +/— . The y axes 
display reads per million mapped reads per 25 bp. 
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Infection with the APAF virus resulted in a marked increase in levels of 
RNA transcripts that map to the TSS as well as to the downstream gene 
region. The high abundance of transcripts associated within the gene 
body is indicative of effective RNA elongation, which has an essential 
role in regulation of the pathogen-driven host response’’”’. Compared 
to cells infected with APAF virus, the cells infected with wild-type virus 
displayed slightly reduced levels of TSS-associated RNAs but greatly 
diminished levels of transcripts associated with the gene body and 3’ 
end region (Fig. 3a). This pattern is indicative of impaired transcrip- 
tional elongation and is consistent with defective elongation in other 
experimental settings***’. The differences between the impact of APAF 
and wild-type viruses on the global dynamics of antiviral gene tran- 
scription are exemplified by differences in transcription of selected 
antiviral genes (Fig. 3a, right panel). Infection with either wild-type 
or APAF virus has no effect on transcriptional dynamics of housekeep- 
ing genes and genes not affected by viral infection (Fig. 3b). 

The ability of NS1 to control directly hPAF1C-dependent tran- 
scriptional elongation was further supported by the analysis of the 
impact of the NS1 tail on hPAF1C-driven RNA elongation in vitro. In 
this assay, a chromatinized DNA template, supplemented with selected 
transcription factors and histone-modifying enzymes, supports 
hPAF1C-mediated transcription in a fashion that faithfully reproduces 
transcriptional elongation in vivo’* (Fig. 4a). We found that NS1 has a 
negative impact on hPAFIC function (Fig. 4b). The comparison 
between the amount of in vitro synthesized RNA in the absence or 
presence of hPAF1C shows the ability of hPAF1C to boost RNA syn- 
thesis (Fig. 4b). Addition of purified NS1 (Supplementary Fig. 5c) to 
the transcription reaction reduced the amount of synthesized RNA to 
background levels (Fig. 4b). Addition of purified NS1 that lacks the 
PAF1-binding sequence had no effect on RNA synthesis (Fig. 4b). Both 
wild-type NS1 and NS1 that lacks the PAF] binding sequence have no 
impact on transcription of non-chromatinized template in vitro 
(Supplementary Fig. 6). This selective and direct impact of NS1 on 
virus-induced gene expression has been further supported by a lack of 
NS1 requirement for the association between the viral and the host 
RNA polymerases in infected cells***° (Supplementary Fig. 7). 

Our findings provide unambiguous proof for the ability of the NS1 
tail to interfere directly with hPAF1C-driven transcription of antiviral 
genes. 


hPAFIC controls antiviral gene expression and infection 
The significance of NSI-hPAFIC interaction during infection was 
evaluated by two independent experimental approaches. Deletion of 
the hPAF1C binding sequence in NS1 resulted in reduced viral titres 
of the APAF virus as compared to those of the virus carrying full- 
length NS1 (Supplementary Fig. 8). This finding is in agreement with 
the previously reported positive role of the NS1 carboxy terminus in 
influenza A virulence”. 

We also revealed the central role of hPAFIC in transcriptional 
response induced by various viruses, as well as by the viral RNA 
analogue poly(I:C) and type I interferon. To study the impact of 
PAF1 on influenza-induced antiviral gene expression we used the 
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Figure 4 | NS1 inhibits transcriptional elongation in vitro. a, The full-length 
NS1 protein (NS1) or NS1 lacking the PAF1-binding sequence (NS1(APAF)) 
(Supplementary Fig. 5c) was added to the RNA elongation reaction as 
indicated. b, The amount of the 390-nt RNA elongation product was quantified 
by ImageJ. The results of two independent experiments are shown. 
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NS1-deficient influenza virus (PR8/ANS1) that elicits an extremely 
potent antiviral response and is therefore particularly useful to study 
host factors that regulate the antiviral response’*”*. Infection with PR8/ 
ANS1 of control siRNA-treated cells resulted in a strong upregulation 
of numerous antiviral genes (Fig. 5a). This response was reduced in 
PAF 1-deficient cells (Fig. 5a, Supplementary Fig. 9, left panel, and 
Supplementary Table 3). The negative effect of PAF1 deficiency on 
antiviral response is not a consequence of impaired virus life-cycle 
caused by a reduction in viral gene expression (Supplementary Fig. 10). 

PAF1 deficiency also attenuated the transcriptional response to 
wild-type influenza A H1N1 (Supplementary Fig. 11, left panel). The 
general positive role of PAF1 in antiviral gene expression was under- 
scored by reduced expression of antiviral genes in PAF1-deficient cells 
infected with vesicular stomatitis virus (VSV) as well as cells treated 
with poly(I:C) (Supplementary Fig. 11 and Supplementary Tables 
4-6). PAF] deficiency also downregulates the transcriptional response 
to type I interferon that governs the antiviral immunity (Fig. 5b, Sup- 
plementary Fig. 9, right panel and Supplementary Table 7). The 
impaired antiviral response by PAF1-deficient cells contributes to a 
nearly tenfold increase of NS1-deficient PR8/ANS1 virus replication, 
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Figure 5 | PAF1 controls antiviral response. a, b, The expression levels of 
mRNAs in influenza infected (a) or IFN-f1-treated (b) control (siCtrl) or 
PAF 1-deficient (siPAF) A549 cells. The tables show the siPAF-affected gene 
categories. Ut, untreated with siRNA. c, Dynamics of virus replication in 
control or PAF 1-deficient A549 cells. p.fu. plaque-forming units. Data are 
representative of three independent experiments. Error bars show the s.e.m. 
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as compared to infected control cells (Fig. 5c). The impact of PAF1 
deficiency on antiviral gene expression is selective. PAF1 deficiency 
does not alter the expression of housekeeping genes (Supplementary 
Fig. 12 and Supplementary Table 8). 


Concluding remarks 


We have shown that H3N2 influenza A virus interferes with host gene 
expression by exploiting the very basic principles of the epigenetic 
control of gene regulation. By mimicking the histone H3K4 sequence, 
which has a key role in positive regulation of gene transcription, the 
influenza virus gains access to histone-interacting transcriptional 
regulators that govern inducible antiviral gene expression. The 
histone-like unstructured NS1 tails are present in the majority of 
human influenza A isolates, but their primary sequence differs 
between viruses. The presence of potentially reactive amino acid 
residues within the unstructured NS1 tails and the nuclear localiza- 
tion of NS1 predict a multitude of nuclear proteins interacting with 
influenza NS1 in a sequence- and/or modification-specific fashion. 
In extension of the histone code that has provided a conceptual 
framework for understanding chromatin-based gene regulation’, 
we propose the existence of the ‘NS1 code’ that guides influenza virus 
interference with host chromatin. The interference with chromatin 
function may complement other immunosuppressive functions of 
NS1 and provide the influenza virus with an opportunity to affect 
host gene expression in a highly selective fashion by recognizing and 
using epigenetic patterns of the host cells. This mechanism may con- 
tribute to the influenza-variant-specific pathogenesis and disease pro- 
gression that differ between viruses carrying the histone-like NS1 tail 
and those viruses, such as human H1N1 (including pandemic 2009 
virus), that lack the NS1 tail. It is possible that by targeting the 
actively transcribed antiviral genes, the NS1 tail helps influenza to 
balance its virulence against the preservation of the infected cell func- 
tion. Such a mechanism could help the virus to maintain its long-term 
presence within the human population. Our findings could also be 
applied to the development of novel anti-influenza therapies that will 
interfere in a highly selective fashion with the NS1 tail binding to its 
chromatin partners. Finally, identification ofhPAF1C asa key mediator 
of antiviral and pro-inflammatory gene response may pave the way for 
the development of anti-inflammatory therapies that—similar to the 
recently described ‘epigenetic’ therapies*’—will interfere selectively 
with hPAFIC binding to its partners. 


METHODS SUMMARY 


The methylation and acetylation in vitro has been performed as described previ- 
ously'*”*, The analysis of NS1 methylation or acetylation in vivo has been done by 
using custom-made methyl-specific or commercial acetyl-specific antibodies”. 
Isolation of the NS1 binding proteins, their analysis by mass-spectrometry and 
western blotting were performed according to a protocol used to identify histone- 
bound proteins™. The recombinant hPAF1C and its individual components were 
prepared as described before**. Interaction between hPAFIC and its individual 
components with peptides were performed as reported***’. The transcriptional 
assays have been performed as described**. Genome-wide distribution of NS1 and 
other chromatin-bound proteins were accessed by ChIP-sequencing as previ- 
ously described’, and analysed using standard bioinformatic approaches. 
Recombinant influenza viruses were generated using standard reverse genetics 
approaches’*””. Infection of cells with viruses was performed according to the 
standard methods’’* and viral titres were quantified by plaque assay. Gene 
expression analysis has been done by microarray using Illumina Human HT-12 
v4 Expression BeadChips or by RNA-sequencing and verified by quantitative real- 
time PCR’. GRO-sequencing was performed as described*'. All methodological 
details are additionally outlined in Supplementary Information. 
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Intrinsic coupling of lagging-strand 
synthesis to chromatin assembly 


Duncan J. Smith! & lestyn Whitehouse? 


Fifty per cent of the genome is discontinuously replicated on the lagging strand as Okazaki fragments. Eukaryotic 
Okazaki fragments remain poorly characterized and, because nucleosomes are rapidly deposited on nascent DNA, 
Okazaki fragment processing and nucleosome assembly potentially affect one another. Here we show that 
ligation-competent Okazaki fragments in Saccharomyces cerevisiae are sized according to the nucleosome repeat. 
Using deep sequencing, we demonstrate that ligation junctions preferentially occur near nucleosome midpoints 
rather than in internucleosomal linker regions. Disrupting chromatin assembly or lagging-strand polymerase 
processivity affects both the size and the distribution of Okazaki fragments, suggesting a role for nascent chromatin, 
assembled immediately after the passage of the replication fork, in the termination of Okazaki fragment synthesis. Our 
studies represent the first high-resolution analysis—to our knowledge—of eukaryotic Okazaki fragments in vivo, and 
reveal the interconnection between lagging-strand synthesis and chromatin assembly. 


During eukaryotic chromosome replication, both genetic and epigenetic 
information must be accurately duplicated. For chromatin architecture 
and modifications to be truly epigenetic—that is, heritable despite not 
being genetically encoded—complete disruption and dissociation of 
nucleosomes from the replication fork must be prevented to allow the 
rapid re-deposition of precisely located and appropriately modified 
histones on the nascent DNA. Histone chaperone complexes govern 
both nucleosome disassembly and assembly at the replication fork’. 
DNA replication is inherently asymmetric. Okazaki fragment syn- 
thesis on the lagging strand necessitates the repeated production of 
single-stranded DNA and polymerization in the opposite direction to 
fork progression. Given the delay in lagging-strand synthesis and the 
rapidity of histone deposition behind the replication fork’, these two 
processes may be interlinked. A coordinated series of events occurs each 
time an Okazaki fragment is synthesized’. Each fragment is initiated via 
an RNA primer* and up to 30 nucleotides of DNA, both synthesized by 
DNA polymerase « (Pol «)-primase*; subsequently, the clamp loading 
factor RFC mediates the replacement of Pol « by the sliding clamp 
PCNA and the processive polymerase Pol 6 (ref. 6). Pol 6 extends the 
nascent DNA chain through the 5’ end of the preceding Okazaki frag- 
ment”*; this strand-displacement synthesis generates a 5’ RNA or DNA 
flap that is cleaved by nucleases such as the flap endonuclease Fen1 
(ref. 9). Repeated cycles of extension and DNA cleavage followed by 
Pol idling’® produce a nick that migrates away from the replication fork 
and can be sealed by DNA ligase I'’. Little to no DNA synthesized by the 
error-prone Pol % remains in the genome after replication”, suggesting 
that strand displacement generally replaces at least 30 nucleotides of 
DNA, but it remains unclear how the replication machinery coordinates 
the transition from Pol 5 to DNA ligase. Indeed, despite their fun- 
damental importance in replication, comparatively little is known about 
the in vivo properties of eukaryotic Okazaki fragments: although they 
are widely accepted to be shorter than the 1-2 kilobases (kb) observed in 
prokaryotes, a clear consensus has not emerged'*'*'°. Moreover, we 
have little information about how nucleosome assembly may interact 
with lagging-strand synthesis. A plausible explanation for the difference 
in size between eukaryotic and prokaryotic Okazaki fragments is that 
eukaryotic DNA replication occurs within the context of chromatin. 


To investigate the relationship between lagging-strand processing 
and chromatin assembly, we have developed strategies to analyse 
Okazaki fragments purified from S. cerevisiae. We find that Okazaki 
fragments are heterogeneously sized with a repeat length correspond- 
ing to that of nucleosomes. In addition, we find that ligation junctions 
occur preferentially around known nucleosome midpoints, and that 
these relationships can be altered by interfering with chromatin 
assembly or Pol 6 processivity. Our data suggest an integrated model 
whereby nascent chromatin structure facilitates Okazaki fragment 
processing, which may provide a means to monitor nucleosome 
assembly behind the replication fork. 


Nucleosome-sized Okazaki fragments 

To enrich for ligation-competent Okazaki fragments, we constructed 
strains of S. cerevisiae in which DNA ligase I'' (CDC9) expression is 
driven from a doxycycline-repressible promoter and the encoded protein 
tagged with a temperature-sensitive degron’®. The inability to ligate 
Okazaki fragments should result in the accumulation of nicked DNA 
that, after purification, can be radiolabelled using DNA polymerase and 
o-°*P dCTP. Ligase inactivation in an asynchronous culture resulted in 
the accumulation of short heterogeneous DNA species whose abund- 
ance increased with duration of ligase inhibition (Fig. 1a). Notably, 
these fragments displayed an underlying periodicity reminiscent of 
the nucleosome repeat. Indeed, comparison of the nicked DNA to a 
nucleosome ladder indicated that a significant fraction of fragments 
seem to be sized according to the 165-bp nucleosomal repeat length 
in S. cerevisiae (Fig. 1b). 

To clarify that we were observing Okazaki fragments, we confirmed 
that they did not appear in a Gl-arrested culture until S-phase release 
(Fig. 1c). Additionally, we treated purified DNA with recombinant 
DNA ligase; this repaired the nicks and severely diminished our ability 
to label fragments (Fig. 1d), showing that our assay predominantly 
detects nicked DNA rather than species containing single-stranded 
gaps or flaps resulting from incomplete Okazaki fragment processing. 
Unrelated control experiments indicated that a 125-nucleotide species 
previously reported to be Okazaki fragments'*”” is 5S ribosomal RNA 
(Supplementary Fig. 2). Thus, we demonstrate that the mono-, di- and 
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Figure 1 | DNA ligase I depletion in S. cerevisiae leads to the accumulation 
of Okazaki fragments sized similarly to the nucleosome repeat. 

a, Transcriptional repression of DNA ligase I (CDC9) results in the accumulation 
of nicked DNA. Cells carrying a doxycycline-repressible allele of the CDC9 gene 
were treated with doxycycline (Dox) for the indicated time. Purified genomic 
DNA was labelled using exonuclease-deficient Klenow fragment and o-°”P 
dCTP and separated in a denaturing agarose gel. nt, nucleotides. b, The size of 
labelled Okazaki fragments mirrors the nucleosome repeat. Okazaki fragments 
(lane 2) were labelled as in Fig. 1a; nucleosomes were prepared from wild-type 
(lane 3) or repressible CDC9 strains (lanes 4 and 5) by in vivo micrococcal 
nuclease (MNase) digestion. The chromatin digestion patterns in lanes 4 and 5 
indicate that CDC9 repression does not alter global chromatin structure. 

c, Okazaki fragments accumulate during S phase. Cells were arrested in G1 using 
a-factor, during which time CDC9 transcription was inhibited by the addition of 
doxycycline, and degradation of the protein stimulated by activation of the 
degron system using galactose and 37 °C. Okazaki fragments appear upon 
release of the culture into S phase (lanes 3 and 4). d, Okazaki fragments are 
bordered by ligatable nicks. Purified DNA was treated (lanes 2 and 4) or mock- 
treated (lanes 1 and 3) with the indicated ligase, and then labelled as in Fig. 1a. 
The inability to label fragments after ligase treatment confirms that labelled 
Okazaki fragments are flanked by ligatable nicks. 


trinucleosome-sized fragments observed on ligase inhibition are 
generated in S phase and bordered by ligatable nicks—the essential 
properties of ligation-competent Okazaki fragments. 


Global distribution of Okazaki fragments 

To explore further the relationship between Okazaki fragments and 
nucleosomes, we developed a deep sequencing approach to map the 
strand, position and abundance of fragments across the S. cerevisiae 
genome. To allow yeast to complete S phase in the presence of nicked 
DNA we inactivated the DNA damage checkpoint by deleting the 
RAD9 gene. Additionally, to deplete ligase activity further we deleted 
the second DNA ligase (DNL4). DNL4 or RAD9 deletion does not 
affect Okazaki fragment size (Supplementary Fig. 3). Okazaki frag- 
ments were harvested from an asynchronous culture after a ~2.5h 
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ligase inactivation. Small single-stranded fragments were purified by 
anion exchange chromatography in alkaline conditions and sequen- 
cing primers were ligated directly to each end’* (Supplementary Fig. 4). 
Importantly, this method preserves strand identity. 

After paired-end deep sequencing, we aligned the data to the 
S. cerevisiae genome and found that use of a single asynchronous 
culture was sufficient to attain complete coverage (a representative 
chromosome is shown in Fig. 2a, and a comparison of replicates in 
Supplementary Fig. 5). Sequencing reads aligning to the Watson or 
Crick strands showed a complementary distribution with strand bias 
being particularly pronounced close to replication origins’; such 
asymmetry is expected, given that replication forks proceed bidirec- 
tionally from origins (Fig. 2b). The strong strand bias observed around 
experimentally validated replication origins mapped at high resolution 
(Fig. 2c)” demonstrates preferential sequencing of nascent lagging 
strands, unequivocally confirming that the DNA species enriched after 
ligase repression are Okazaki fragments. Detailed analysis of global 
Okazaki fragment distributions can provide mechanistic insight into 
replication-fork dynamics (S. McGuffee, D.J.S. and LW., manuscript in 
preparation). 


Okazaki fragments terminate within nucleosomes 

The periodic size of Okazaki fragments observed in Fig. 1 led us to 
investigate how the ends of Okazaki fragments relate to nucleosome 
positions found within the yeast genome. We aligned Okazaki frag- 
ment termini to a reference list of all consensus nucleosome midpoints 
(dyads) in S. cerevisiae’. Inherent bias towards smaller fragments 
during purification, library amplification and sequencing result in size 
distributions that differ substantially between starting material and 
sequenced fragments (Supplementary Fig. 6); therefore, we randomly 
selected fragments from our sequencing data to approximate the 
mononucleosome-sized fragments we detected by electrophoresis 
(Supplementary Fig. 6). Aligning both the 5’ and 3’ ends of mature 
(Fig. 3a) mononuclesome-sized fragments against the dyads of 
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Figure 2 | Sequenced Okazaki fragments show a pronounced bias towards 
the lagging strand. a, Distribution of paired-end sequencing hits mapping to 
either the Watson (blue, above the axis) or Crick (orange, below) strands across 
S. cerevisiae chromosome 10. The locations of replication origins’ are indicated 
as grey dashed lines. Data are unsmoothed. b, The anticipated distribution of 
Okazaki fragments surrounding an efficient origin. c, Sequenced Okazaki 
fragments are strongly enriched in regions predicted to be replicated as the 
lagging strand. Log, ratio of Watson strand: Crick strand fragments across a 50- 
kb window around ARS consensus sequences (ACS) confirmed previously” to 
correspond to active origins. Data are smoothed to 200 bp. 
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Figure 3 | Okazaki fragment termini are preferentially located at 
nucleosome dyads. a, Schematic indicating the mature Okazaki fragment 
termini sequenced in this study. b, Okazaki fragment termini are enriched 
around nucleosome dyads. The distribution of termini from mononucleosome- 
sized Okazaki fragments around the top 50% consensus S. cerevisiae 
nucleosome dyad locations”' is shown. Unless otherwise indicated, all analyses 
use unsmoothed data normalized to the maximum signal in the analysed range, 
and area aligned such that Okazaki fragment (OF) synthesis by Pol 5 proceeds 
from left to right. c, The distribution of Okazaki fragment termini correlates 
with nucleosome occupancy” around TSSs. Data are aligned to TSSs such that 
the direction of transcription is from left to right: only Okazaki fragments 
synthesized in the same direction as transcription are analysed here; equivalent 
analyses for fragments synthesized in the opposite direction can be found in 
Supplementary Fig. 7. Data are smoothed to 5 bp. d, Okazaki fragments at 
nucleosome-depleted regions are disproportionately likely to be long. Equal 
numbers of reads were selected from the top (long), and the bottom three 
(short) length quartiles of sequenced fragments. The log, ratio of long to short 
fragments is plotted. Data are smoothed to 5 bp. 


high-confidence nucleosomes indicated that the highest density of 
ends occurs at the dyad (Fig. 3b). Alignment of unfiltered data and 
dinucleosome-sized fragments is comparable to the mononucleosome 
size-selected subset (Supplementary Fig. 6). Therefore, rather than 
being overrepresented in the accessible internucleosomal linker 
DNA, we find that Okazaki fragment termination is most likely to 
occur at locations corresponding to nucleosome dyads. 


DNA-bound proteins dissociate Pol 5 


To confirm that the observed correspondence between nucleosome 
occupancy and Okazaki fragment end density was not biased bya small 
subset of nucleosomes, we analysed transcription start sites (TSSs), at 
which nucleosome positioning follows a well-defined pattern’. 
Alignment of Okazaki fragment termini around ~4,500 TSSs indi- 
cated a marked correlation between nucleosome occupancy and the 
density of Okazaki fragment ends (Fig. 3c and Supplementary Fig. 7) 
across a broad region surrounding TSSs. Precise nucleosome position- 
ing around TSSs is determined by chromatin remodelling”’”’, allowing 
us to infer that the correlation refers to remodelled, rather than intrins- 
ically preferred, nucleosome locations. 

If the ends of Okazaki fragments are dictated by nucleosomes then 
nicks should always be in register with positioned nucleosomes, 
regardless of internucleosomal spacing. Thus, nucleosomes separated 
by long linker DNA should be associated with longer Okazaki frag- 
ments. We took advantage of the differential nucleosome spacing 
surrounding the nucleosome-depleted region (NDR) at TSSs, com- 
paring the distribution of equal numbers of randomly selected long 
and short fragments (defined, respectively, as the top quartile of 
sequenced fragments by length, and the bottom three quartiles). 
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When the ratio of long to short fragments is plotted across this region 
(Fig. 3d and Supplementary Fig. 7), we find that fragments whose ends 
map within nucleosomes bordering the NDR are disproportionately 
long. Thus, Okazaki fragment length reflects nucleosome spacing. 

As well as correlating with nucleosome occupancy, Okazaki frag- 
ment termini were enriched around known transcription factor bind- 
ing sites (Fig. 3c)**”*. This suggested that, in addition to nucleosomes, 
sequence-specific DNA binding factors might directly influence the 
ends of Okazaki fragments. Such correlations would be expected if, 
following replication-fork passage, some transcription factors rapidly 
bound to replicated DNA and impeded Pol 6 during strand-displacement 
synthesis, causing the polymerase to dissociate and leave a nick close to 
the site of collision. Furthermore, this model would also provide an 
explanation for the observed link between nucleosome occupancy 
and Okazaki fragment termination: nucleosomes contain numerous 
weak protein-DNA interactions, whose cumulative strength peaks 
around the dyad’*. Polymerases can invade nucleosomes, but experi- 
ence increasing resistance as they approach the dyad*””*. Therefore, if 
nucleosomes are rapidly re-established on the lagging strand behind the 
replication fork, and Pol 6 invades them during strand-displacement 
synthesis, the likelihood of polymerase stalling and/or dissociation will 
increase with proximity to the dyad (see model in Supplementary Fig. 1), 
producing Okazaki fragments sized according to the nucleosome 
repeat. Other models that invoke biased Okazaki fragment initiation 
due to replisome pausing on encountering nucleosomes would pro- 
duce distinct correlations around NDRs (Supplementary Fig. 8). 

If polymerization by Pol 6 is inhibited by sequence-specific DNA- 
bound transcription factors, Okazaki fragment ends should accumulate 
on the side of the factor that is first encountered by Pol 6. Aligning all 
transcription factor binding sites** indicated a significant enrichment of 
precisely juxtaposed Okazaki fragment 5’ and 3’ termini around the 
replication-fork-proximal side of the binding site (Fig. 4a). However, 
when the Abf1, Reb1 and Rap1 binding sites were considered separately 
we found that, among transcription factors, these proteins most 
strongly biased the positioning of Okazaki fragment ends (Fig. 4b, c 
and Supplementary Fig. 9). Abfl, Reb1 and Rap1 have essential roles in 
chromatin organization’, and may therefore bind more rapidly and 
tightly to newly replicated DNA than the majority of transcription 
factors. We observed a strand-dependent enrichment of Okazaki frag- 
ment termini around the 3’ ends of tRNA genes, as well as transcription 
factor TFIIIB binding sites (Supplementary Fig. 10). Thus, it seems that 
diverse DNA binding proteins can influence the processing of Okazaki 
fragments and probably facilitate the dissociation of Pol 4, with tightly 
bound protein complexes stimulating termination more precisely than 
those, such as nascent histones, that have more diffuse binding regions. 


Chromatin assembly dictates nick location 
Two testable predictions arise from the model that Pol 6 dissociates 
via interaction with nascent nucleosomes: (1) disrupting nucleosome 
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Figure 4 | Transcription factors with roles in nucleosome positioning 
stimulate dissociation of Pol 6. a, Okazaki fragment termini are enriched on 
the replication-fork-proximal side of known transcription factor (TF) binding 
sites”. Mid, midpoint of TF binding site. b, c, The enrichment of Okazaki 
fragment termini around transcription factor binding sites can be attributed 
almost entirely to three factors—Abf1, Reb1 and Rap1—known to have roles in 
nucleosome positioning. 
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assembly will impair nucleosome-mediated Okazaki fragment ter- 
mination and should, therefore, alter the size distribution of the frag- 
ments; and (2) reducing Pol 6 processivity will result in a shift of 
Okazaki fragment termination sites away from the dyad, towards 
the replication-fork-proximal side of nucleosomes. 

The deposition of (H3H4), tetramers on nascent DNA—the first 
stage in nucleosome assembly—is mediated in part by CAF-1, a multi- 
subunit histone chaperone complex associated with the replisome via 
PCNA”. We deleted each subunit of CAF-1 individually, and com- 
pared the length of Okazaki fragments in the deletion strains to those 
with wild-type CAF-1 (Fig. 5a). Loss of any CAF-1 subunit abrogated 
the periodic sizing of Okazaki fragments, significantly increasing their 
average length. These data are consistent with a global delay in 
(H3H4), deposition following replication, with the resulting paucity 
of tetrasomes/nucleosomes on the lagging strand leading to increased 
strand displacement by Pol 6 and thus to longer Okazaki fragments. 

In S. cerevisiae, Pol 6 is a heterotrimer whose Pol32 subunit 
increases processivity and PCNA binding affinity’'; Pol 6 lacking 
Pol32 shows decreased strand-displacement synthesis in vitro and 
probably in vivo’. Labelled Okazaki fragments from a pol32A strain 
were more heterogeneous but were still generally sized according to 
the nucleosome repeat (Fig. 5b). When the positions of the 5’ and 3’ 
termini of mononucleosome-sized fragments (165 + 15 nucleotides, 
as per Fig. 3b) from the pol32A strain were aligned against nucleosome 
midpoints, a clear shift was observed towards the first edge of the 
nucleosome encountered by the polymerase (Fig. 5c and Supplemen- 
tary Fig. 11). Therefore, when Pol 3 processivity is perturbed, the 
polymerase dissociates more rapidly due to an inability to invade nascent 
nucleosomes assembled on Okazaki fragments. The new maximum end 
density occurs ~35-40 nucleotides from the dyad, consistent with the 
extent of DNA protected by (H3H4), tetramers**: nucleosome assembly 
occurs step-wise, with assembly of (H3H4), tetramers preceding 
recruitment of H2A/H2B”™; it is therefore possible that the histone- 
DNA species encountered by Pol 6 is a tetrasome rather than a nucleo- 
some, although our data do not allow us to distinguish between these 
possibilities. Importantly, the distribution of Okazaki fragment termini 
around transcription factor binding sites was identical between wild- 
type and po/32A strains (Supplementary Fig. 9), allowing us to con- 
clude that the shift towards the edge of nucleosomes does not simply 
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Figure 5 | Impaired chromatin assembly and Pol 5 processivity affect the 
size of Okazaki fragments and the location of their termini, respectively. 
a, Deletion of any component of the CAF-1 complex (lanes 2-4) results in 
Okazaki fragments that are no longer sized according to nucleosomes. WT, 
wild type. b, Deletion of the pol32 subunit of Pol 6 (lane 2) does not abrogate the 
chromatin-like size distribution of Okazaki fragments. c, In the absence of 
Pol32, the termini of mononucleosome-sized Okazaki fragments are shifted 
towards the replication-fork-proximal edge of the nucleosome: peak density 
occurs at a location consistent with the predicted edges of (H3H4), tetramers. 
See also Supplementary Fig. 11. 
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represent a constant decrease of ~35-40 bp in the extent of strand 
displacement. 


Discussion 


Our data suggest a mechanism by which Pol 4 may reliably remove DNA 
synthesized by Pol « while avoiding excessive strand-displacement syn- 
thesis. Repeated cycles of extension and cleavage can occur during the 
synthesis of a single Okazaki fragment. After removal of the RNA 
primer, all DNA flap structures generated by Pol 6 are biochemically 
indistinguishable from one another, and an external mechanism is 
therefore required to measure the extent of strand displacement already 
carried out by Pol 6 on each fragment. The removal of Pol 6 by newly 
deposited histones represents a simple way to constrain Pol d extension, 
and might allow nucleosome assembly to stimulate replication-fork 
progression directly. Although our data imply that nucleosomes 
strongly impede Pol 6, we note that both Fenl and DNA ligase I can 
act efficiently on nucleosomal substrates***°. We speculate that carrying 
out Okazaki fragment processing in the context of nucleosomes rather 
than DNA allows the ligation reaction to outcompete further strand 
displacement by Pol 6. 

CAF-1 is not essential in S. cerevisiae, suggesting that other 
histone chaperones such as Asfl, Rttl106 and HIR” allow sufficient 
replication-coupled or post-replicative nucleosome assembly for viab- 
ility. Nevertheless, our results are consistent with a temporal delay in 
nucleosome assembly in CAF-1 mutants, which may give rise to previ- 
ously reported silencing defects**. The importance of replication- 
coupled chromatin assembly in metazoa is illustrated by the severe 
phenotypes of CAF-1 or ASF1 disruption in human cell lines: ASF1 
depletion leads to replication-fork stalling’, and CAF-1 depletion also 
precludes progression through S phase”. 

Our studies demonstrate that the location of Okazaki fragment 
termini can be determined by interactions between the lagging-strand 
polymerase and nascent nucleosomes. This observation provides the 
first direct mechanistic evidence for the coupling of DNA replication 
to chromatin assembly on the newly replicated daughter genomes. 
Such coupling is fundamentally important given the part played by 
chromatin in the regulation of gene expression, as well as the potential 
for the epigenetic inheritance of precisely located modified nucleo- 
somes. In addition, transcription factors known to have roles in the 
establishment of chromatin structure are apparently able to re-bind to 
DNA immediately after replication-fork passage. Unlike histones, 
which are present in sufficient number to be distributed to both 
daughter genomes, DNA-bound transcription factors are present at 
only one local copy per two daughters. Regulation of transcription 
factor re-binding could thus have a key role in asymmetric epigenetic 
inheritance. 


METHODS SUMMARY 


Genomic DNA was prepared from spheroblasts essentially as described for medium 
resolution DSB mapping", labelled using exo-Klenow fragment (NEB) and «-*?P 
dCTP and separated in 1.3% denaturing agarose gels. 

Okazaki fragments were purified by binding heat-denatured genomic DNA to 
Source 15Q (GE Healthcare) at pH 12, 300 mM NaCl, and eluting in 50 mM steps 
to 1 M. Primers for paired-end Illumina TruSeq sequencing were directly ligated 
to each end. Following second-strand synthesis and size selection by gel purifica- 
tion, libraries were amplified by PCR, purified from two sequential 2.5% native 
agarose gels, and sequenced directly. Full details are in Supplementary Fig. 4 and 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


DNA purification. Yeast strains carrying degron-tagged, doxycycline-repressible 
alleles of CDC9 and a galactose-inducible UBR1 allele (see Supplementary Table 1 
for a list of strains) were grown at 30 °C in YEP supplemented with 2% raffinose. 
At optical density (OD) 0.4, galactose and doxycycline were added to final con- 
centrations of 2% and 40 mg1™ y respectively, and the culture shaken at 37 °C for 
2.5h. Fifty-millilitre cultures were used for labelling experiments, and 250-ml 
cultures for purification and library generation. 

Genomic DNA was prepared from spheroblasts as described for medium 

resolution DSB mapping”. Following ligase repression, cells were collected by cent- 
rifugation, washed in SCE buffer (1M sorbitol, 100 mM sodium citrate, 60 mM 
EDTA, pH 7.0) and spheroblasted for 3 min with 5 mg zymolyase 20T (USB) per 
50-ml culture. Spheroblasts were washed with SCE, and resuspended in 480 ul lysis 
buffer (50mM Tris-HCl, pH 8.0, 50mM EDTA, 100mM NaCl, 1.5% sarkosyl) 
containing 150 1g proteinase K (Fisher). Digestion was carried out for 2-16h at 
37 °C. After digestion, residual proteins and peptides were precipitated by adding 
200 pl 5M KOAc and spinning at 16,000g for 30 min at 4°C. Nucleic acids were 
precipitated from the supernatant by addition of 500 pl isopropanol and centrifu- 
gation at 16,000g for 10 min. Pellets were washed twice with 500 pl 70% ethanol, 
resuspended in 200 tl STE buffer (10 mM Tris-HCl, pH 8.0, 1 mM EDTA, 100 mM 
NaCl) and digested with 25 ug RNase A (Sigma) and/or 10 U RIboshredder RNase 
blend (Epicentre) at 37 °C for 30 min. Genomic DNA was precipitated by addition 
of 20 pl NaO Ac, pH 5.5 and 800 ul ethanol followed by centrifugation at 5,000g for 
10 min at room temperature (25 °C). Pellets were washed with 70% ethanol and 
resuspended in 1 tl TE (10 mM Tris:Cl pH 7.5, 0.1mM EDTA) per ml original 
culture volume. DNA was stored at 4 °C and never frozen. 
DNA labelling. Two microlitres of DNA (corresponding to the genomic DNA 
content of 2 ml cultured cells) was used in 20 il labelling reactions containing 5 U 
Klenow (exo-)polymerase (NEB) and o-dCTP (Perkin Elmer) at a final concen- 
tration of 33 nM. Free label was removed using Illustra microspin G-50 columns 
(GE healthcare). Labelled DNA was separated in 1.3% denaturing agarose gels 
(50 mM NaOH, 1 mM EDTA). After electrophoresis, the gel was neutralized and 
DNA transferred to an uncharged nitrocellulose membrane (Hybond-N; GE 
healthcare) via capillary transfer. Membranes were exposed to phosphor screens 
or film. 
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Okazaki fragment purification. Genomic DNA purified as described earlier was 
denatured by heating to 95°C for 5 min, rapidly cooled on ice and brought to 
300 mM NaCl, pH 12. Purification was carried out in batch using 400 pl Source 
15Q (GE healthcare), binding at 300 mM NaCl, pH 12 and eluting in 50 mM steps 
to 1M NaCl, pH 12. As determined by purification of fragments pre-labelled as 
above (see also Supplementary Fig. 4), fractions from 800-900 mM NaCl con- 
tained the majority of fragments of interest. DNA was ethanol precipitated and 
treated with 10 U Riboshredder RNase blend for 30 min at 37 °C to remove residual, 
undigested RNA: digestion products were removed using Illustra microspin G-50 
columns to leave essentially pure Okazaki fragments. 

Sequencing library generation. Adaptor primer pairs with single-stranded 
overhangs (shown schematically in Supplementary Fig. 4) were annealed by cooling 
from 95 °C and purified from 12% native polyacrylamide gels via standard methods. 
Sequences of the adaptor pairs are as follows. 5’ top, ACACTCTTTCCCTACACG 
ACGCTCTTCCGATCT; 5’ bottom, NNNNNNAGATCGGAAGAGCGTCGTGT 
AGGGAAAGAGTGT; 3’ top, /Phos/AGATCGGAAGAGCGGTTCAGCAGGAA 
TGCCGAG; 3’ bottom, CTCGGCATTCCTGCTGAACCGCTCTTCCGATCTN 
NNNNN. 

Up to 200 ng denatured purified Okazaki fragments were incubated at 16 °C 
overnight in a ligation reaction containing 1 jig of each primer pair and 1,000 U 
T4 DNA ligase (NEB). Unligated adaptors were removed using Illustra microspin 
S-300 columns (GE healthcare) and a second strand-synthesis reaction carried 
out at 72°C using Taq polymerase (NEB). Products from ~200-1,000 bp were 
purified from 2.5% agarose gels run in TBE using Qiaquick kits (Qiagen). Purified 
libraries were amplified (16 cycles) using Illumina Truseq primers according to 
Illumina protocols, except that KOD hot start polymerase (Novagen) was used. 
Amplified libraries were purified from two sequential 2.5% agarose gels. 
Subsequent steps in the sequencing workflow were carried out according to 
standard procedures. 

Nucleosome dyad and occupancy data. The top 50% of nucleosome dyads (by 
confidence score), taken from a meta-analysis”, were used in our analysis; nucleosome 
occupancy data (Fig. 3b) were from the YPD data set detailed previously. 


42. Kaplan, N. et al. The DNA-encoded nucleosome organization of a eukaryotic 
genome. Nature 458, 362-366 (2009). 
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Resonant quantum transitions in trapped 


antihydrogen atoms 


C. Amole', M. D. Ashkezari?, M. Baquero-Ruiz*, W. Bertsche**®, P. D. Bowe’, E. Butler®, A. Capra’, C. L. Cesar’, M. Charlton‘, 
A. Deller*, P. H. Donnan”, S. Eriksson‘, J. Fajans*", T. Friesen””, M. C. Fujiwara!” 3, D. R. Gill’, A. Gutierrez!*, J. S. Hangst’, 
W. N. Hardy'*"’, M. E. Hayden’, A. J. Humphries’, C. A. Isaac’, S. Jonsell'®, L. Kurchaninov’’, A. Little’, N. Madsen‘, 

J.T. K. McKenna’, S. Menary/, S. C. Napoli’, P. Nolan’, K. Olchanski'’, A. Olin'®, P. Pusa!’”, C. @. Rasmussen’, F. Robicheaux!®, 
E. Sarid!”, C. R. Shields*, D. M. Silveira7°y, S. Stracka!?, C. So’, R. I. Thompson”, D. P. van der Werf* & J. S. Wurtele*?! 


The hydrogen atom is one of the most important and influential 
model systems in modern physics. Attempts to understand its 
spectrum are inextricably linked to the early history and develop- 
ment of quantum mechanics. The hydrogen atom’s stature lies in 
its simplicity and in the accuracy with which its spectrum can be 
measured’ and compared to theory. Today its spectrum remains a 
valuable tool for determining the values of fundamental constants 
and for challenging the limits of modern physics, including the 
validity of quantum electrodynamics and—by comparison with 
measurements on its antimatter counterpart, antihydrogen—the 
validity of CPT (charge conjugation, parity and time reversal) 
symmetry. Here we report spectroscopy of a pure antimatter atom, 
demonstrating resonant quantum transitions in antihydrogen. We 
have manipulated the internal spin state** of antihydrogen atoms 
so as to induce magnetic resonance transitions between hyperfine 
levels of the positronic ground state. We used resonant microwave 
radiation to flip the spin of the positron in antihydrogen atoms 
that were magnetically trapped** in the ALPHA apparatus. The 
spin flip causes trapped anti-atoms to be ejected from the trap. We 
look for evidence of resonant interaction by comparing the survival 
rate of trapped atoms irradiated with microwaves on-resonance to 
that of atoms subjected to microwaves that are off-resonance. In 
one variant of the experiment, we detect 23 atoms that survive in 
110 trapping attempts with microwaves off-resonance (0.21 per 
attempt), and only two atoms that survive in 103 attempts with 
microwaves on-resonance (0.02 per attempt). We also describe the 
direct detection of the annihilation of antihydrogen atoms ejected 
by the microwaves. 

Magnetostatic trapping of neutral atoms’ or anti-atoms is accomp- 
lished by creating a local minimum of the magnetic field magnitude in 
free space. The confining force results from interaction of the atomic 
magnetic moment pt with the non-uniform magnetic field. Figure 1 shows 
the expected Breit-Rabi hyperfine level diagram for the ground state of 
the antihydrogen atom in a magnetic field. We label the four eigenstates 
|a), |b), |c) and |d) in order of increasing energy. Trapping is possible 
when the atom is in a ‘low-field seeking’ quantum state (|c) or |d) in 
Fig. 1). We employ the Ioffe-Pritchard’ configuration: the superposition 
of a magnetic multipole (an octupole) field that confines atoms in the 
transverse directions and two ‘mirror coil’ fields for axial confinement’. 

Working at the Antiproton Decelerator’ facility at CERN, we 
recently demonstrated magnetic confinement of cold antihydrogen 


atoms* and showed that—once trapped—these atoms end up in their 
ground state, where they can be held’ for up to 1,000 s. Here we use the 
same apparatus, modified to enable injection of microwaves into the 
trapping volume (Fig. 2a). Antihydrogen atoms are produced near the 
field minimum (about 1T, Fig. 2b) by mixing cold plasmas of 
antiprotons and positrons for about 1s (Methods). Atoms having 
kinetic energies corresponding to less than 0.5K can be trapped. 
Mixing about two million positrons and 20,000 antiprotons yields 
approximately 6,000 anti-atoms; on average, approximately one 
anti-atom is trapped. The trapping field currents can be ramped down 
with a time constant of 9 ms, releasing trapped atoms in a well-defined 
time window*. The trapping volume is surrounded by a three-layer, 
30,720-channel imaging silicon detector’®, which can locate the spatial 
positions—vertices—of antiproton annihilations. 

Our approach was to subject trapped antihydrogen atoms to res- 
onant microwaves to eject them from the trap. A tuned, oscillating 
magnetic field B, applied perpendicularly to the trapping field can 
drive positron spin-flip transitions between the trappable and the 
untrappable states, that is, |c)—>|b) and |d)—>|a). Untrapped atoms 
escape and annihilate on the surrounding apparatus. A single experi- 
mental cycle or ‘trapping attempt’ involves producing anti-atoms in 
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Figure 1 | Hyperfine energy levels. The Breit-Rabi diagram, showing the 
relative hyperfine energy levels of the ground state of the hydrogen (and 
antihydrogen, assuming CPT invariance) atom in a magnetic field. In the state 
vectors shown (for the high-field limit), the single arrow refers to the positron 
spin and the double arrow refers to the antiproton spin. 
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Figure 2 | The ALPHA device. a, Cut-away, schematic drawing of the 
antihydrogen synthesis and trapping region of the ALPHA apparatus. The 
superconducting atom-trap magnets, the annihilation detector, and some of the 
Penning trap electrodes are shown. An external solenoid (not shown) provides 
a 1-T magnetic field for the Penning trap. The drawing is not to scale. The inner 
diameter of the Penning trap electrodes is 44.5 mm and the minimum-B trap 
has an effective length of 274 mm. Microwaves are injected along the axis of the 
trapping volume using a horn antenna, which is located about 130 cm from the 
trap axial midpoint. b, Map of magnetic field strength in the ALPHA 
antihydrogen trap. The red contour bounds a region up to 0.35 mT (or 10 MHz 
in microwave frequency equivalent) above the minimum field, to roughly 
indicate the size of the resonant volume. 


the magnetic trap, holding any trapped atoms first for 60s (during 
which the magnetic field may be changed) and then for 180s (during 
which microwaves may be introduced), and then intentionally releas- 
ing any remaining atoms to detect their annihilation. 

To select the proper microwave frequencies and magnetic field 
configurations, we consider (Fig. 3a) the calculated positron spin res- 
onance lineshapes for equal numbers of trapped |c) and |d) state atoms 
exposed to microwaves. The abrupt low-frequency onsets are asso- 
ciated with the minimum in the static magnetic field near the trap 
centre; the high-frequency tails reflect the highly inhomogeneous 
nature of the trapping fields elsewhere. We choose the resonance 
condition such that transitions are induced as atoms pass close to 
the magnetic minimum (Fig. 2b). This choice yields higher transition 
rates than elsewhere in the trap, and it localizes the position in space 
where transitions occur, and whence the resulting high-field seeking 
atoms are ejected. We do not knowa priori the hyperfine level in which 
atoms are trapped; for a given magnetic field configuration we need to 
alternately irradiate the trap at two frequencies (details in Fig. 3 legend) 
separated by the zero-field hyperfine splitting of 1,420.4 MHz. 

We collect two distinct, complementary types of annihilation data. 
First, at the end of every trapping attempt, we rapidly turn off the 
confining fields and watch for annihilation events in a 30-ms window 
from the start of the magnet shutdown. This allows us to determine an 
effective trapping rate. The ejection of trapped atoms by resonant 
microwaves will reduce this rate. Rates for application of resonant 
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microwaves, off-resonant microwaves and no microwaves can be com- 
pared: these are ‘disappearance mode’ data. The second data set comes 
from monitoring annihilation events throughout the entire time that 
antihydrogen atoms are held in the trap. We look for events from 
ejected atoms during the time that resonant microwave fields are 
applied: these are ‘appearance mode’ data. 

Because the duration of the observation window differs significantly 
between these two modes, we rely on two different cosmic background 
rejection algorithms (Methods). In the 30-ms ‘disappearance window’ 
we use the algorithm developed earlier* (the ‘default criteria’). The 
rate at which cosmic ray events are interpreted as annihilations by this 
selection scheme is (4.7+0.2)X10 7s '. For the much longer 
‘appearance mode’ observation (180s), we rely on an alternative set 
of acceptance criteria that, compared to the default criteria, reduces 
annihilations by 25% but lowers the cosmic background by an order of 
magnitude. To avoid experimenter bias, the two sets of criteria are 
optimized and cross-checked using control samples*®: cosmic ray 
events and annihilation events collected independently of the trapping 
experiments described here. 

We conducted six series of measurements. For series 1, we set the 
minimum on-axis trapping field Bs to some value B“ (Methods), and 
then applied resonant microwave fields at frequencies fi and fi 
(Fig. 3b; see the legend for microwave frequency drive details) during 
the 180-s hold portion of the cycle. For series 2, we shifted B*'S to 
B® > B® by increasing the mirror coil currents such that microwave 
fields oscillating at f;s and f4 are detuned by 100 MHz and are no 
longer resonant with atoms at the centre of the trap (Fig. 3c). The field 
was shifted (in about 1 s) after the mixing and initial trapping phase of 
each trapping attempt, and a waiting period of 59s was imposed to 
allow the field to stabilize before microwave introduction. This con- 
figuration should eliminate |d) > |a) transitions and reduce the rate of 
|c) |b) transitions. The latter can still occur as atoms pass through 
regions of space in which the local magnetic field brings them into 
resonance with microwave fields applied at frequency f‘\. Series 3 
involved operating at the higher field B® and shifting the microwave 
frequencies so as to bring both transitions back into resonance 
(Fig. 3d). Series 4 field and frequency conditions were identical to those 
of series 2, but series 4 attempts were interleaved with those of series 3. 
This repetition attempts to minimize possible systematic effects due to 
time variations in the experimental conditions. Antiproton beam and 
plasma conditions—and thus the initial trapping rate—can vary from 
day to day, so on- and off-resonance experiments were interspersed. 
Thus, series 1 and 2 were taken under very similar conditions, and 
series 3 and 4 constitute a second, complementary set of measure- 
ments. In concert with series 1-4, we measured trapping and annihila- 
tion rates with B* set to B* or B® when no microwaves are injected 
into the apparatus (series 5 and 6, respectively). Apart from changes in 
magnetic field or microwave conditions, the experimental procedure 
was identical for each of the six series. 

A summary of ‘disappearance mode’ data appears in Tables 1 and 2. 
The errors quoted are based on counting statistics only. By comparing 
the Poisson rate of the process of interest with the rate of the control 
process, we evaluate the probability (P-value) that the observed number 
of outcomes, or a more extreme one, could have been produced by 
background fluctuations'!. We observe a clear decrease in the survival 
rate for the cases in which microwaves are injected on-resonance, as 
compared to the equivalent off-resonance measurements, with a 
P-value of 1.0 107°. 

The two measurement sets (series 1-2; series 3-4) could have dif- 
ferent systematic uncertainties. For example, in the former, the mirror 
field shift might affect the orbit dynamics of the trapped antihydrogen 
(a hypothesis not supported by numerical simulations), while the latter 
could suffer from different microwave field characteristics between 
on- and off-resonance frequencies (again, this is not supported by our 
detailed off-line measurements with electron plasmas; see Methods). 
However, both data sets show decreases in the on-resonance rate 
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Figure 3 | Transition lineshapes and resonance conditions. a, Calculated 
spin-flip transition lineshapes in the ALPHA antihydrogen trap. Transition 
probability (arbitrary units) is plotted versus microwave frequency. Only the 
trapping field inhomogeneity is considered in calculating the lineshape. 

b, Schematic representation of the experimental situation for the on-resonance 
experiments at magnetic field B* (series 1). The yellow bands represent the 
frequency ranges over which the microwaves are scanned. c, The situation for 
off-resonance experiments at magnetic field BP (series 2 and 4). d, The situation 


measurements, compared to the associated off-resonance measure- 
ments, with P-values of 1.6 X 10 * and 1.5 X 10°”, respectively, sup- 
porting the hypothesis that the difference is due to spin flip. 

We note that the survival rates for the no-microwave measurements 
are higher than for those in which microwaves are present but off- 
resonance (the P-value is 6 X 10 *). This difference could be explained 
by far off-resonant interactions with the |c) state, assuming there is 
sufficient microwave power to induce spin flips in the long tails of the 
resonance lineshape (Fig. 3c). (See the discussion of the appearance 
mode data below.) 

We also directly searched for annihilation signals of anti-atoms that 
are ejected from the trap after a spin-flip transition—the ‘appearance 
mode’ described above. Figure 4a shows the time history of events 
satisfying the alternative acceptance criteria and having |z|<6cm 
(Methods). In the first frequency sweep (0 <t< 30s) we observe a 
significant excess of counts (P = 2.8 X 10°) in on-resonance (series 1 


Table 1 | Series summaries for the ‘disappearance mode’ analysis 


for on-resonant experiments at magnetic field B® (series 3). A two-segment 
frequency sweep lasting 30s was used to apply microwave fields. This sweep 
was repeated six times in each trapping attempt for a total microwave 
application time of 180s, beginning 60s after the end of antihydrogen 
formation. In each case, the first 15-s scan covers the lower yellow band and the 
second 15-s scan covers the upper yellow band. The bands span —5 MHz to 
+10 MHz about the target frequency. 


plus series 3) compared to off-resonance attempts (series 2 plus series 4). 
Seven of the 19 events appearing in 0 < t< 15s (microwaves probing 
fo) Occur in the first second; for 15 < t< 30s (probing f,4) the first 
second has seven of 18 events. This suggests that the microwave power 
is sufficient to flip most of the spins during the first 30-s sweep, in 
agreement with numerical simulations of the transition rate 
(Methods). An investigation of power dependence indicated that a level 
as low as one-sixteenth of the nominal 700 mW injected (Methods) was 
still enough to eject the trapped atoms in the first 30-s sweep, again 
consistent with the simulations. 

In the off-resonant experiments, we observe a mild excess of counts 
above the no-microwave case (series 5 plus series 6) with an associated 
P=5.6%X10 ~. We interpret this excess to be due to the above- 
mentioned off-resonance interaction with the |c) state. This conclu- 
sion is supported by the fact that the events are in 15<t<30s 
(Fig. 4a), when the microwaves are probing the upper 15-MHz 


Series Relative microwave Relative magnetic field Number of attempts Antihydrogen detected Rate Comment 
frequency at trap shutdown 

1 O MHz OmT (Bs = BY) 79 1 0.01+0.01 On resonance (Fig. 3b) 
2 O MHz +3.5 mT (Bxs = B®) 88 16 0.18 + 0.05 Off resonance (Fig. 3c) 
3 +100 MHz +3.5 mT (Bis = B®) 24 1 0.04 + 0.04 On resonance (Fig. 3d) 
4 O MHz +3.5 mT (Bxis = B®) 22 7 0.32 +0.12 Off resonance (Fig. 3c) 
5 Off OmT (Bis = B’) 52 17 0.33 + 0.08 No microwaves 

6 Off +3.5 mT (Bxis = B®) 48 23 0.48 + 0.10 No microwaves 
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Table 2 | Totals for all ‘disappearance mode’ series 


Number of attempts Detected antihydrogen Rate 
On resonance (1 + 3) 103 2 0.02 + 0.01 
Off resonance (2 + 4) 110 23 0.21 +0.04 
No microwaves (5 + 6) 100 40 0.40 + 0.06 


frequency band (Fig. 3c), and by indications that the nominal power 
should be sufficient to drive off-resonant transitions. Taken together, 
the disappearance and appearance analyses constitute a consistent 
picture of the fate of the trapped antihydrogen atoms. 
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Figure 4 | Appearance mode data. a, The number of ‘appearance mode’ 
annihilation events satisfying the alternative selection criteria and |z|<6cm 
(Methods) as a function of time between the end of antihydrogen production 
and the trap shutdown. Microwave power is first applied at time t = 0. The 
expected cosmic background per bin per run is 0.026 + 0.005 events. The error 
bars are due to counting statistics. b, The z-distribution of annihilation vertices 
in ‘appearance mode’ for 0 < t< 30s. The grey histogram is the result of a 
numerical simulation of the motion of spin-flipped atoms ejected from the trap. 
The dashed black curve is the result of a simulation of trapped antihydrogen 
annihilating on the residual gas (Methods). Both simulations are normalized to 
the on-resonant data. 
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We have considered other processes that could lead to antihydrogen 
loss in the presence of microwaves but that could not be due to a spin- 
flip. The only plausible candidate is heating of the trap electrodes by 
the microwaves, causing desorption from the surfaces of cryo-pumped 
material, which could then scatter or annihilate the trapped anti- 
atoms. Indeed, we observe a slight electrode temperature increase from 
about 8 K to at most 11 K during the 180-s microwave cycle. However, 
any such thermal effect on the vacuum should be the same for series 1 
and 2, which differ only by a slight change in the trapping magnetic 
field. Further evidence against vacuum deterioration comes from 
Fig. 4b, which shows the z-distribution of appearance-type events 
(in 0<t< 30s). The distribution is highly localized around the trap 
centre, as we expect from simulations of how spin-flipped atoms are 
lost from the trap (Methods). Annihilation or collisional loss of 
trapped anti-atoms in a compromised vacuum could occur anywhere 
in the 274-mm-long trapping volume. 

We thus conclude that we have observed resonant interaction of 
microwave radiation with the internal quantum states of trapped 
antihydrogen atoms. This is a proof-of-principle experiment; we have 
not yet attempted to accurately localize a resonance or determine a 
spectroscopic lineshape. We have bounded the resonance between the 
off-resonance scan value and the maximum of the on-resonance 
sweep. Roughly speaking, the observed resonance is within 100 MHz 
of the resonance frequency expected for hydrogen, corresponding to a 
relative precision of about 4 X 10 *. This experiment marks the advent 
of antimatter spectroscopy and takes a preliminary step towards 
precision comparison of the spectra of hydrogen and antihydrogen 
as a test of CPT symmetry. Importantly, it also demonstrates the 
viability of performing fundamental measurements on small numbers 
of trapped anti-atoms by combining resonant interaction with the long 
trapping times and sensitive annihilation detection in ALPHA. In 
future experiments, the transition |c)<>|d) could be probed by double 
resonance; the frequency of this transition goes through a broad 
maximum” at a field of 0.65 T, allowing a precision measurement of 
hyperfine parameters without requiring precise knowledge of the abso- 
lute value of B (see Supplementary Information). 


METHODS SUMMARY 


The ALPHA apparatus traps antihydrogen atoms synthesized from cold plasmas 
of positrons and antiprotons. Microwaves from a frequency synthesizer were 
amplified and injected into the magnetic atom trap using a horn antenna. We 
use electron cyclotron frequency measurement techniques to set the magnetic field 
in the device, and to characterize microwave field patterns. We perform numerical 
simulations of trapped antihydrogen dynamics to model microwave resonant 
lineshapes and transition rates, atom ejection dynamics, and the spatial distri- 
bution of residual gas annihilation. Two distinct analysis methods are used to 
reduce cosmic ray background in the annihilation detector. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Antihydrogen synthesis and trapping. The ALPHA techniques for synthesizing 
trappable anti-atoms are described extensively elsewhere*®. Antihydrogen atoms 
are produced near the trap minimum by mixing cold plasmas of antiprotons and 
positrons for about 1 sina Penning-Malmberg"* trap. The mixing makes use of the 
evaporative cooling™ (for both positrons and antiprotons) and autoresonant injec- 
tion’* techniques developed for our initial demonstration of trapping. At the end of 
the 1-s synthesis stage the magnetic trap fields are on and the trapping region has 
been cleared of any remaining charged particles. The anti-atoms are then held in 
the trap for 240s before being released. During the first 1 s of this time period we 
either ramp the mirror coil currents from 650 A to 692 A (adding 3.5 mT to B® to 
attain B”), or do nothing (to remain at BA), During the next 59 s we wait to ensure 
that the currents in the mirror coils have stabilized. Finally, during the last 180 s we 
either inject microwaves or not, depending on the measurement type. 
Microwave injection. Ka-band microwaves from an Agilent 8257D PSG Signal 
Generator are amplified and injected down the axis of the apparatus via a waveguide- 
fed horn antenna. The maximum power used was about 700 mW, measured at the 
vacuum transition. 

Electron cyclotron resonance diagnostics. We measure the electron cyclotron 
resonance frequency by loading an electron plasma in the centre of the trap. A 
series of 4-11s microwave pulses is injected, at frequencies scanned across the 
cyclotron resonance; these pulses heat the plasma. Between each pulse we allow 
the plasma to return to its equilibrium temperature. Simultaneously, we monitor 
the quadrupole vibrational mode of the plasma by applying an oscillating potential 
at 26.5 MHz to an electrode adjacent to the plasma and measuring the plasma 
response on another. The frequency of this mode shifts approximately linearly 
with changes in temperature’®. When the microwave frequency matches the cyclotron 
frequency, the heating of the plasma and the quadrupole frequency shift will be 
maximized. This method allows us to determine and to set the trapping magnetic 
field and to ensure field stability between trapping attempts. 

Using the quadrupole frequency shift diagnostic, we can also infer the in situ 
amplitude of the microwave electric fields near the trap centre. We adjust the 
solenoid field so that the electron cyclotron resonance frequency is equal to one 
of the spin-flip transition frequencies and inject resonant microwave pulses to heat 
the plasma. From the temperature increase we can infer that the peak electric field 
amplitudes for 700mW injected power are about E(f{)=110Vm', 
E(f§)=150V m *, E(fy.) =130V m+, E(f,) =100 Vm. 

In yet another mode of operation, we fix the microwave frequency and apply an 
axial magnetic field gradient across a long (about 4cm) electron plasma so that 
only a narrow slice of the plasma is in resonance. The external solenoid field is then 
swept through resonance to generate a map of electric field strength along the 
length of the plasma, reflecting the underlying standing-wave pattern. This pro- 
vides another check of the similarity of microwave field distributions at the four 
frequencies, as well as of variations over the 15-MHz frequency sweeps. We see no 
evidence for significant differences in the microwave environment at the two pairs 
of frequencies (fi, f4 and f2, f.,) or within the sweeps. 

Magnetic field settings. The background solenoid field of about 1 T is the same as 
that used previously’. We use the electron cyclotron resonance technique to 
quantify the contributions of the solenoid and trap magnets to B*“S and to deter- 
mine the field change necessary to achieve the 100 MHz offset for off-resonant 
operation. The microwave frequencies used for driving (fi, f‘\) were 
(28.276, 29.696) GHz and for (f,3, f8,) were (28.376, 29.796) GHz. The electron 
cyclotron resonance measurements were used to monitor field stability from 
attempt to attempt; any necessary corrections were done by adjusting the back- 
ground solenoid field. The reproducibility of the field-setting procedure translates 
to +2 MHz in microwave frequency. 

Numerical simulations of antihydrogen dynamics. We use a mixture of 
quantum and classical mechanics to simulate the effect of the microwaves on 
the trapped antihydrogen, and to calculate spatial distributions, both for ejected 
atoms, and for atoms lost by annihilation on the residual gas. The simulated anti- 
atoms’ are in a low-field seeking state and are launched from the region of the 
positron plasma with a 50 K thermal distribution; only those with kinetic energy 
less than about 0.5 K are trapped. The atomic motion is calculated classically using 
a smooth fit to the magnetic field to obtain the centre-of-mass force. 

The spatial structure of the microwave field in the electrode stack is complex, 
but an order-of-magnitude estimate of spin-flip transition rates can be obtained 


by assuming that the microwave magnetic fields B, are those of a plane wave 
propagating in free space. During each simulation time step, we check whether the 
spin-flip resonance condition was met. If it was, we compute the transition 
probability from the standard Landau—Zener approximation for a two state system 
using a three-point time fit to the energy and coupling parameters. In the strong 
trapping field, the coupling matrix element between the states is approximately 
B, 1/4. The resonance condition is met twice each time the atom passes through 
the centre of the trap, and we allow for the unlikely possibility of the spin flipping 
twice. The coupling matrix element V can be related to the Rabi frequency, 
Q=V/h. For a microwave electric field of 100 Vm’ * (giving a vacuum intensity 
of 1.3mWcm ”), B, is about 0.33 1T and the Rabi frequency is approximately 
1.5 X 10*rads° '. Simulations do not lead to a simple exponential decay of trapped 
population when the microwaves are present, because antihydrogen trajectories 
differ in how they pass through the resonance volume. As a rough estimate, a 
microwave intensity of 2mW cm ” gives a flip rate of the order of 1s’. 

Default criteria for annihilation event identification. The detector tracks the 
trajectories of charged pions that are produced when released antihydrogen atoms 
encounter matter in the Penning trap electrodes and annihilate. A reconstruction 
algorithm that considers track topology is then used to discriminate between pion 
tracks and cosmic ray events, and ultimately to locate the spatial position (‘vertex’) 
of each annihilation event'”. The detector and the ‘default criteria’ for the event 
discrimination procedure have been extensively described previously*”. 

Alternative criteria for annihilation event identification. We use a bagged 
decision tree classifier, in the random forest approach'*°, to separate antiproton 
annihilations on the trap walls from cosmic ray events. Nine variables are used for 
classification: (1) the radial and (2) the azimuthal coordinates of the reconstructed 
annihilation vertex, if present, (3) the total number of channels registering ‘hits’ by 
charged particles, (4) the number of three-hit combinations used as track candi- 
dates, (5) the number of reconstructed tracks, (6) the sum of the squared residual 
distances of hits from a fitted straight line, and three topological variables. The 
topological variables comprise (7) a sphericity variable, (8) the cosine of the angle 
between the event axis and the detector axis, and (9) the angle between the 
vertical and the projection of the event axis onto the transverse (x—y) plane. 


3 
The sphericity variable is defined as the quantity 32 +3). Here 4, >A. >23 


N = we 
are the eigenvalues of the tensor SB — yy p%p! |p; | *) i N, where p* is the 
component « (where «=.x,y,z) of the momentum associated with the ith track. 


The event axis is defined as the line passing through the centre of the detector and 
oriented along the eigenvector associated with A. 

The random forest event-selection criteria have been determined by maximiz- 
ing a sensitivity figure of merit*'. Compared to the ‘default’ selection, this method 
is about ten times more effective in rejecting cosmic background, while retaining 
75% of the signal. For Fig. 4a, based on dynamical simulations, we require the 
event’s axial position z to be less than 6cm away from the trap centre. This 
requirement affects the signal only marginally and further suppresses the back- 
ground bya factor of three, resulting in a cosmic rate of (1.7+0.3) x 107° 5 |. For 
Fig. 4b we select an annihilation candidate if it falls within 0 < t< 30s (the first 
microwave sweep). 
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Electron tomography at 2.4-angstrém resolution 
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Transmission electron microscopy is a powerful imaging tool that 
has found broad application in materials science, nanoscience and 
biology’ *. With the introduction of aberration-corrected electron 
lenses, both the spatial resolution and the image quality in trans- 
mission electron microscopy have been significantly improved** and 
resolution below 0.5 angstréms has been demonstrated®. To reveal 
the three-dimensional (3D) structure of thin samples, electron 
tomography is the method of choice”""’, with cubic-nanometre reso- 
lution currently achievable’*"’. Discrete tomography has recently 
been used to generate a 3D atomic reconstruction of a silver nano- 
particle two to three nanometres in diameter’’, but this statistical 
method assumes prior knowledge of the particle’s lattice structure 
and requires that the atoms fit rigidly on that lattice. Here we report 
the experimental demonstration of a general electron tomography 
method that achieves atomic-scale resolution without initial 
assumptions about the sample structure. By combining a novel 
projection alignment and tomographic reconstruction method with 
scanning transmission electron microscopy, we have determined 
the 3D structure of an approximately ten-nanometre gold nano- 
particle at 2.4-angstrém resolution. Although we cannot defini- 
tively locate all of the atoms inside the nanoparticle, individual 
atoms are observed in some regions of the particle and several grains 
are identified in three dimensions. The 3D surface morphology 
and internal lattice structure revealed are consistent with a distorted 
icosahedral multiply twinned particle. We anticipate that this 
general method can be applied not only to determine the 3D struc- 
ture of nanomaterials at atomic-scale resolution’*’’, but also to 
improve the spatial resolution and image quality in other tomo- 
graphy fields””'*”°. 

Since its introduction in 1968, electron tomography has been 
primarily used to determine the 3D structure of biological samples””. 
In the past decade, electron tomography has been increasingly applied 
in materials science and nanoscience through the use of scanning 
transmission electron microscopy (STEM)*’*"". The highest resolu- 
tion at present achieved by STEM tomography is around 1 nm in three 
dimensions'*”’, although slightly higher resolution has been obtained 
in a study of fullerene-like nanostructures with bright-field electron 
tomography”. A general electron tomography method with atomic- 
scale resolution, however, has not been demonstrated for several 
reasons. First, aligning the projections of a tomographic tilt series to 
a common axis with atomic level precision is technically challenging. 
Second, radiation damage limits the number of projections that can be 
acquired from a single object'*’. Last, specimens cannot usually be 
tilted beyond +79°, preventing acquisition of data from the ‘missing 
wedge’ "'. Here we demonstrate that these limitations can be overcome 
or alleviated by applying a novel alignment approach and an iterative 
tomographic reconstruction method to a tilt series obtained via annular 
dark field (ADF)-STEM”*™, 

The conventional alignment approach used in electron tomography 
either relies on fiducial markers such as colloidal gold beads or is based 
on the cross-correlation between neighbouring projections””. To our 


knowledge, neither of these alignment approaches can achieve atomic- 
level precision. To overcome this limitation, we have developed a 
method based on the centre of mass, which is able to align the projec- 
tions ofa tilt series at atomic-level accuracy even with noise (Methods). 
To address the other two difficulties mentioned above, we have imple- 
mented a data acquisition and tomographic reconstruction method, 
termed equally sloped tomography'®'* °°” (EST). Compared to con- 
ventional tomography, which reconstructs a 3D object froma tilt series 
of projections with constant angular increments, EST acquires a tilt 
series with equal slope increments, and then iterates back and forth 
between real and reciprocal space (Methods). In each iteration, 
constraints such as the sample boundary (that is, the support) and 
positivity of the Coulomb potential are applied in real space, while 
the measured projectional slices (that is, the Fourier transform of the 
projections) are enforced in reciprocal space. Each iteration is 
monitored by an error metric, and the algorithm is terminated after 
reaching a maximum number of iterations. 

To test the feasibility of achieving a high-resolution tomographic 
reconstruction by the centre-of-mass and EST methods, we conducted 
numerical simulations on an ~5-nm gold nanoparticle with icosahedral 
symmetry and a total of 3,871 atoms (Supplementary Figs 3a, 4a, c, e). 
A tilt series of 55 projections was obtained from the particle using 
multislice STEM calculations”® (energy, 300 keV; spherical aberration, 
1.2mm; illumination semi-angle, 7.98mrad; defocus, 48.6 nm; 
detector inner and outer angles, 13 and 78 mrad). To minimize 
nonlinear intensity contributions caused by dynamical scattering 
and electron channelling”, projections along zone axis orientations 
were avoided. The tilt angles range from —72.6° to +72.6° with equal 
slope increments. To more closely approximate realistic experimental 
conditions, several additional modifications were made to generate the 
simulated data. First, the tilt angles were continuously shifted from 0° 
to 0.5° over the process of the tilt series and the magnification of the 
images was continuously changed from 0 to 0.2%. Second, each 
projection in the tilt series was arbitrarily shifted along the x and y 
axes, where the electron beam direction is parallel to the z axis. Last, 
Poisson noise was added to the projections of the tilt series with a total 
electron dose of 6.1 X 10°e A’ *. Supplementary Fig. 3 shows a linear 
projection of the model at 0° and the corresponding multislice STEM 
projection. The apparent increase of the atom size in the multislice 
projection was mainly caused by the nonlinear and diffraction effects 
in the nanoparticle. 

The 55 projections were aligned to a common tilt axis with the 
centre-of-mass method, and were then reconstructed with the EST 
algorithm. Supplementary Fig. 4b, d, f shows three 2.5-A-thick central 
slices of the 3D reconstruction in the x-y, z-x and z-y planes. 
Although the missing wedge problem is not completely solved (the 
top and bottom parts in Supplementary Fig. 4f) and the size of the 
reconstructed atoms is increased mainly due to the nonlinear and 
diffraction effects, the atomic positions and grain boundaries in the 
3D reconstruction are consistent with the model. The simulation 
results indicate that the centre-of-mass and EST methods can be used 
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to achieve an atomic-scale resolution reconstruction from a tilt series 
of 55 projections with a missing wedge, nonlinear effects, Poisson noise 
and experimental errors. 

Next, the centre-of-mass and EST methods were applied to experi- 
mental tilt series acquired from gold nanoparticles. Nanoparticles are 
an important class of materials with properties different from either 
molecules or bulk solids'*”*, and nano-gold is among the most widely 
studied of this class of material owing to its broad applications in 
chemistry, biology, materials science, nanoscience and nanotechnology™*. 
In this study, we imaged gold particles with a diameter of ~10nm 
because smaller particles are not as stable under an electron beam’*”’. 
To reduce the electron dose, we used a low-exposure data acquisition 
scheme with a 300-keV ADF-STEM (Methods). When focusing an 
image, a nearby nanoparticle was first viewed (not the particle of 
interest), thus reducing the unnecessary radiation dose to the particle 
under study. Using this scheme, we acquired several tomographic tilt 
series of gold nanoparticles. Supplementary Fig. 5 shows a tilt series of 
69 projections and their Fourier transforms, with a total electron dose 
of ~7.6 X 10°e A *. Supplementary Fig. 6 shows three 0° projections 
and their Fourier transforms measured during the acquisition of this 
tilt series to monitor the effects of radiation damage. Although some 
minor shape changes occurred, the crystal lattice structure of the par- 
ticle remained consistent throughout the experiment. 

To investigate the nonlinear effects in the experiment, we simulated 
an ~10-nm gold particle with icosahedral symmetry and performed 
multislice STEM calculations on an 11.5-A-thick slab of this particle”® 
(Supplementary Fig. 7). Although the atom size was increased, mainly 
owing to the nonlinear diffraction and dynamical scattering effects, the 
multislice STEM projection exhibits a crystal lattice structure consist- 
ent with the model. We then calculated a tilt series for a 2.5-A central 
slice using the multislice simulations. The tilt series consists of 69 
projections with a tilt range of +72.6°. Supplementary Fig. 8 shows 
the model and reconstructed slices. The atomic positions and the 


Figure 1 | Evaluation of the 3D reconstruction quality. a-d, Representative 
measured (a) and calculated (c) projections and their Fourier transforms 
(respectively b, d) at tilt angle 7.1°. Insets in a and c show the projected atomic 
positions inside the blue square in the main panel. The overall shape of the 
nanoparticle and the location of the Bragg peaks agree well, indicating a good 
quality 3D reconstruction. This gold nanoparticle is the same as studied in 
Figs 2-4, 
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internal grain boundaries are resolved, except in a very few places 
owing to the nonlinear effects in the projections. 

After investigating the nonlinear effects, we performed post data 
analysis of the experimental tilt series (Methods), and aligned the pro- 
jections with the centre-of-mass method. To reconstruct the 3D struc- 
ture, we first estimated a loose 3D support, defined to be larger than the 
particle boundary. After 500 iterations of the EST algorithm, the recon- 
struction was used to determine a tight support (that is, close to the true 
boundary of the particle). Using the tight support, we ran another 500 
iterations and obtained a final 3D structure. To examine the quality of 
the reconstruction, we calculated 69 projections from the final 3D 
structure and found the average normalized discrepancy with the 
measured projections to be 6.7% (Supplementary Methods and 
Supplementary Table 1). Representative measured and calculated pro- 
jections at different particle orientations are shown in Fig. 1 and Sup- 
plementary Fig. 9. Although there are some minor differences between 
the two projections, the overall shape and lattice structure agree well. To 
more rigorously examine the accuracy of the reconstruction, an EST 
reconstruction was performed from 68 experimental projections by 
removing the 7.1° projection. The 3D reconstruction was then projected 
back to calculate the projection at 7.1°, which is consistent with the 
experimentally measured one (Supplementary Fig. 10). 

To estimate the resolution achieved in the reconstruction, we chose 
a 3.36-A-thick central slice in the x-y plane. Figure 2a, b shows the 
slice and its Fourier transform, in which the distance between two 
neighbouring atom columns and the location of the Bragg peaks 
indicate that a resolution of 2.4 A was achieved in the x and y direc- 
tions. To estimate the resolution along the z axis (beam direction), we 
selected a 3.36-A-thick slice with the horizontal axis along the z axis 
(Fig. 2c, d). The resolution close to the z axis was determined to be 
2.4 A. Individual atoms are visible in some regions of the slices, but not 
all atoms can be identified in the slices. Supplementary Fig. 11 shows 
two 3.36-A-thick slices of the nanoparticle in a different orientation, 


Figure 2 | Estimation of the 3D resolution of the reconstruction of the gold 
nanoparticle. a, b, A-3.36-A thick central slice in the x-y plane (a) and its 
Fourier transform (b), indicating that a resolution of 2.4 A corresponding to the 
gold (111) lattice was achieved along the x and y axes. c, d, A 3.36-A-thick slice 
in the z-y plane (c) and its Fourier transform (d) where the horizontal axis is 
along the z axis (beam direction). The resolution in the z axis was estimated to 
be 2.4A. Individual atoms are visible in some regions of the slices, but not all 
atoms can be identified in the slices. 
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exhibiting crystal lattice structure that is not present in Fig. 2a and c. 
The apparent flattening of the particle along the beam axis was 
also observed in the 3D reconstructions (Fig. 2 and Supplementary 
Movie 1), and was probably caused by the interaction between the 
nanoparticle and the Si substrate. 

To visualize the internal structure and the morphology of the gold 
nanoparticle, we generated 3D volume and iso-surface renderings of 
the reconstruction, in which both surface and internal lattice structures 
are visible (Supplementary Movies 1 and 2). Figure 3a, b shows volume 
renderings of the nanoparticle and their Fourier transforms (insets) at 
the two- and three-fold symmetry orientations. The corresponding 
iso-surface renderings at the same orientations are shown in Fig. 3c 
and d. The overall 3D shape and facets of the nanoparticle are con- 
sistent with an icosahedron (insets in Fig. 3c and d). To identify 
internal 3D grains, we applied the 3D Fourier transform to the recon- 
struction. By identifying the Bragg peaks of each major grain and 
applying the 3D inverse Fourier transform to the selected Bragg peaks, 
we determined four major 3D grains inside the gold nanoparticle 
(Methods). Figure 4 and Supplementary Movie 3 show a volume 
rendering of the four grains in three dimensions, in which grains 1, 
2 and grains 3, 4 are related by mirror-reflection across the horizontal 
interfaces marked by dotted lines. The angle enclosed by close-packed 
planes across these interfaces was measured to be 69.9° + 0.8° between 
grains 1 and 2, and 71.3° + 0.8° between grains 3 and 4, both of which 
are consistent with the angle for a face-centred-cubic twin boundary 
(70.53°). By applying the same method to some other Bragg peaks, we 
identified 3D grains in the top and bottom parts of the particle 
(Supplementary Fig. 12). The surface morphology and the internal 
lattice structures suggest that this is a distorted icosahedral multiply 
twinned particle, typically found for nano-gold in the size range above 
10 nm (ref. 29). 

By combining the centre-of-mass alignment technique and the EST 
reconstruction method with an ADF-STEM, we have determined the 3D 
structure of a ~10-nm gold nanoparticle at 2.4 A resolution from a tilt 
series of 69 projections with a missing wedge. Several grains are identified 
inside the nanoparticle in three dimensions. Although individual 


Figure 3 | 3D structure of the reconstructed gold nanoparticle. a, b, 3D 
volume renderings of the nanoparticle and their Fourier transforms (insets) at 
the two-fold (a) and three-fold (b) symmetry orientations. c, d, Iso-surface 
renderings of the nanoparticle at the two-fold (c) and three-fold (d) symmetry 
orientations. Insets show a model icosahedron at the same orientations. 
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Figure 4 | Identification of four major grains inside the gold nanoparticle in 
three dimensions. Grains 1, 2 and grains 3, 4 are related by mirror-reflection 
across the horizontal interfaces marked by dotted lines. The angle enclosed by 
close-packed planes across these interfaces was measured to be 69.9° + 0.8° 

between grains 1 and 2, and 71.3° + 0.8° between grains 3 and 4, both of which 
are consistent with the angle for a face-centred cubic twin boundary (70.53°). 


atoms are visible in some regions of the nanoparticle, we cannot defi- 
nitively locate all of the atoms inside the particle. In order to identify all 
the atoms in the particle (estimated to be ~23,800) without using 
atomicity and bond information, a resolution higher than 2.4 A is 
needed, which requires future developments. With aberration- 
corrected STEM***”*, better 3D resolution and image quality should 
be achievable, but extended depth-of-field techniques may have to be 
applied to the tilt series before the EST reconstruction can be per- 
formed. Compared to atom-probe tomography”’, this non-destructive 
technique can not only handle isolated nanoparticles, but also provide 
the 3D local structure of complex nanomaterials at high resolution. 


METHODS SUMMARY 


Gold nanoparticles with a size of ~10 nm (Ted Pella) were supported on 5-nm- 
thick Si membranes (TEMwindows.com), which can withstand plasma cleaning 
for a longer period than carbon substrates to alleviate carbon contamination. 
Tomographic tilt series with equal slope increments were acquired from the gold 
nanoparticles by using an ADF-STEM (FEI Titan 80-300). The tilt angles (0) were 
determined by'®?> 0=—tan~'[(N+2—2n)/N] for n=1, ..., N and 
0=n/2—tan~'|(3N+2—2n)/N] for n= N+1,...,2N with N = 32 or 64 in this 
experiment. The angles beyond +72.6° were not accessible owing to the geometry 
of the support grid. To monitor the radiation induced stability effect, several 
projections at the same particle orientation were measured during the acquisition 
of each tilt series (Supplementary Fig. 6). To improve the 3D reconstruction and 
enhance the signal to noise ratio, the background of the projections was subtracted 
and 2 X 2 pixel binning was performed for each projection. After post data ana- 
lysis, the tilt series was aligned with the centre-of-mass method (Methods). The 
reconstruction of the aligned tilt series was conducted using the EST method, 
which iterated back and forth between real and reciprocal space with constraints 
enforced in real space and measured data in reciprocal space (Methods, 
Supplementary Methods and Supplementary Fig. 2). To examine the reconstruc- 
tion quality, the reconstructed 3D structure was projected back to obtain 69 
projections, which were compared to the corresponding measured ones. An aver- 
age Ryea (Supplementary Methods) was calculated to be 6.7%, indicating a good 
quality reconstruction (Fig. 1, Supplementary Table 1). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Sample preparation. Gold nanoparticle solution with an average particle size of 
~10nm (Ted Pella) was sonicated for ~10 min to prevent aggregation. Five-nm- 
thick Si membranes (TEMwindows.com) were used as the particle substrates in the 
experiment. The thin membrane, with a size of 100 X 1,500 um, is supported on a 
100-j1m-thick Si frame, allowing for a maximum tilt range of +83°. To avoid break- 
ing the membrane, a micromanipulator was used to place a small drop of solution 
onto the outer frame of the Si grid. After gently moving the drop onto the membrane, 
it was removed and not allowed to dry and leave excessive gold particles and con- 
taminants. The Si grids were cleaned pre-deposition in a Gatan Solarus plasma 
cleaner (Model 950) for 20s using a standard H,/O, recipe. To further ensure 
removal of contaminant sources, the sample holder (Fischione Model 2020) was 
plasma-cleaned for an hour before data acquisition using the same recipe. 
ADF-STEM. STEM images of gold nanoparticles were acquired on a FEI Titan 80- 
300 (energy, 300keV; spherical aberration, 1.2mm; illumination semi-angle, 
7.98 mrad; and defocus, 48.6 nm). The 70 pA electron beam was focused to a probe 
with a 50 ,1m probe-forming aperture (C2) and rastered over the sample. The 
scattered electrons were captured by a Fischione Model 3000 ADF detector with 
angles between 13 mrad and 78 mrad from the optical axis. The use of ADF angles 
was to improve the signal to noise ratio with a low current electron beam. The 
effects of nonlinear image intensities and diffraction contrast were carefully deter- 
mined by multi-slice simulations. The maximum tilt angles were limited by the 
holder to 75°. 

Low-exposure acquisition of tomographic tilt series. In order to reduce vibration 
and drift during data acquisition, the sample holder was allowed to settle for one hour 
after insertion into the microscope, and also for several minutes after moving to each 
new angle. Tilt series were acquired by manually changing the angle with equal slope 
increments. Thetilt angles (0) were determined by'** 0 = — tan! [(N +2—2n)/N] 
forn =1,...,Nand0=1/2— tan '[(3N+2—2n)/N] for n = N+1,...,2N with 
N = 32 or 64 in this experiment. To focus each projectional image during data 
acquisition, a nearby particle was used (rather than the particle of interest) to reduce 
the radiation dose to the particle. By using this low-exposure data acquisition 
scheme, we have obtained several tomographic tilt series. Supplementary Fig. 5 
shows the tilt series used in this reconstruction with 69 projections and a tilt range 
of +72.6°. A representative sinogram of the tilt series is shown in Supplementary 
Fig. 13. The probe current was ~70 pA with a dwell time of 45 ls per pixel, and the 
magnification of each projection was 5.2 X 10°. Since the pixel size in STEM mode 
can vary, a calibration image of the particle was taken in TEM mode, and the STEM 
pixel size was determined to be 0.42 A. The total electron dose of the tilt series was 
estimated to be ~7.6 X 10°e A. Supplementary Fig. 6 shows three 0° projections 
measured during the acquisition of this tilt series. Although some minor shape 
changes occurred, the crystal lattices of the particle remained consistent throughout 
the experiment. 

Post data analysis. In order to apply the EST method, the background surround- 
ing the nanoparticle in each projection has to be subtracted. To systematically 
eliminate the background, we first projected all the projections onto the tilt axis 
and obtained a set of 1D curves. We then determined the optimal cut-off value for 
background subtraction in each projection by maximizing the cross-correlation 
among these 1D curves. After background subtraction, we binned 2 X 2 pixels into 
1 pixel, which was used to enhance the signal to noise ratio in the projections and 
improve the EST reconstruction. The background subtracted and binned projec- 
tions with pixel size of 0.84 A were aligned using the centre-of-mass approach and 
reconstructed with the EST method. 

The centre-of-mass alignment method. To achieve an atomic scale resolution 
reconstruction, the projections in a tilt series have to be aligned to a common axis 
(not necessarily the true tilt axis) with atomic level precision in both the x and y 
axes, where the y axis represents the tilt axis and the z axis the beam direction. To 
align the tilt series along the y axis, the projections were first projected onto the y 
axis and a set of 1D curves was generated. We then chose a 1D curve at 0° as a 
reference, and aligned the remaining curves to the reference. To align the projec- 
tions along the x axis, we developed a method based on the centre of mass. When a 
3D object is tilted around the y axis from 0° to 360°, the centre-of-mass of the 
object forms a circle. However, in the special geometry where the centre of mass 
coincides with the origin of the x axis, this circle becomes a point. To determine the 
centre of mass in this special geometry, we projected each 2D projection onto the x 
axis, chose a pixel as the origin and calculated the centre of mass (CM) along the x 


axis, xem = >. x;p(x;) / >> p(x:), where p(x;) is the Coulomb potential at position 


1 v 
x; We then shifted this projection to set xcm as the new origin of the x axis. 
Through repeating this process for all projections, we aligned the tilt series to 
the common axis that coincides with the new origin. Both our simulation and 
experimental results indicate that the centre-of-mass alignment is a general 
method and can align the projections of a tilt series at atomic level accuracy, even 


with relatively high noise and the nonlinear effects (Fig. 1, Supplementary Figs 4 
and 8, and Supplementary Table 1). 

The EST method. When the projections of a tilt series use equal slope increments, 
it has been shown that a direct fast Fourier transform, the pseudopolar fast Fourier 
transform (PPFFT)"', exists between a pseudopolar grid and a Cartesian grid. 
Supplementary Fig. 1 shows a pseudopolar grid and the PPFFT. For an NX N 
Cartesian grid, the corresponding pseudopolar grid is defined by a set of 2N lines, 
each line consisting of 2N grid points mapped out on N concentric squares. The 2N 
lines are subdivided into a horizontal group (in blue) defined by y = sx, where s is 
the slope and |s| = 1, and a vertical group (in red) defined by x = sy, where |s| = 1; 
the horizontal and vertical groups are symmetric under the interchange of x and y, 
and As = 2/N. When these conditions are met, the PPFFT and its inverse algo- 
rithm are mathematically faithful’’. Note that the PPFFT and its inverse algorithm 
were originally developed to interpolate tomographic projections from a polar toa 
Cartesian grid in reciprocal space. The idea of acquiring tomographic tilt-series at 
equal slope increments and then combining the PPFFT with iterative algorithms 
for 3D image reconstructions was first suggested in 2005 (ref. 25). 

Compared to other data acquisition approaches, such as the Saxton scheme”, the 

EST data acquisition approach is different in that it acquires projections with equal 
slope increments in order to use the PPFFT. Although the PPFFT and its inverse 
provide an algebraically faithful way to do the fast Fourier transform between the 
Cartesian and pseudopolar grids, three difficulties limit its direct application to 
electron tomography. First, the tilt range has to be from —90° to +90°. Second, the 
number of projections in a tilt series needs to be 2N for an N X N object. Third, the 
grid points past the resolution circle (dashed circle in Supplementary Fig. 1) cannot 
be experimentally determined. We overcame these limitations by combining the 
PPFFT with an iterative process'*'* *°”°. Supplementary Fig. 2 shows the schematic 
layout of the iterative EST method. We first convert the electron micrograph 
projections to Fourier slices in the pseudopolar grid. As illustrated in 
Supplementary Fig. 1, the distance between the sampling points on the individual 
2N lines of the pseudopolar grid varies from line to line. In order to calculate the 
Fourier slices from the projections, the fractional Fourier transform is used to vary 
the output sampling distance of the Fourier slices*’. By applying the inverse PPFFT, 
we obtain a 3D image in real space. A 3D support is defined to separate the object 
from a zero region where the size of the zero region is proportional to the over- 
sampling of the projections**. The negative-valued voxels inside the support and 
the voxel values outside the support are set to zero, and a new 3D image is obtained. 
The forward PPFFT is applied to the new image and set of calculated Fourier slices 
is obtained. We then replace the corresponding calculated Fourier slices with the 
measured ones, and the remaining slices are kept unchanged. The iterative process 
is then repeated with each iteration monitored by an Ryecjp (Supplementary 
Methods). The algorithm is terminated after reaching a maximum number of 
iterations. A more detailed description of the EST method is presented in 
Supplementary Methods. Compared to phase retrieval in coherent X-ray/electron 
diffraction imaging”’**~*’, the EST method aims to solve the missing data by com- 
bining an iteration process with the PPFFT algorithm. 
Identification of the major 3D grains inside the nanoparticle. The following 
procedures were used to determine the major 3D grains inside the gold nanopar- 
ticle. (1) Apply the 3D Fourier transform to the reconstructed nanoparticle and 
identify the Bragg peaks corresponding to a major grain. (2) Use small spheres 
with soft edges to select these Bragg peaks and set other values to zero. (3) Apply 
the 3D inverse Fourier transform to the selected Bragg peaks and obtain a 3D 
image. (4) Convolve the 3D image with a Gaussian filter and choose a cut-off value 
to determine the 3D shape of the grain. (5) Use the 3D shape to identify the 
corresponding 3D grain in the reconstructed nanoparticle. (6) Repeat steps 1-5 
to determine other major grains. 
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Large-scale vortex lattice emerging from collectively 


moving microtubules 


Yutaka Sumino'*, Ken H. Nagai**, Yuji Shitaka*, Dan Tanaka*t, Kenichi Yoshikawa’, Hugues Chaté® & Kazuhiro Oiwa*”’ 


Spontaneous collective motion, as in some flocks of bird and 
schools of fish, is an example of an emergent phenomenon. Such 
phenomena are at present of great interest’ ° and physicists have 
put forward a number of theoretical results that so far lack experi- 
mental verification®*. In animal behaviour studies, large-scale 
data collection is now technologically possible, but data are still 
scarce and arise from observations rather than controlled experi- 
ments. Multicellular biological systems, such as bacterial colonies 
or tissues”’°, allow more control, but may have many hidden 
variables and interactions, hindering proper tests of theoretical 
ideas. However, in systems on the subcellular scale such tests may 
be possible, particularly in in vitro experiments with only few 
purified components‘ '*. Motility assays, in which protein filaments 
are driven by molecular motors grafted to a substrate in the presence 
of ATP, can show collective motion for high densities of motors 
and attached filaments. This was demonstrated recently for the 
actomyosin system’*’’, but a complete understanding of the 
mechanisms at work is still lacking. Here we report experiments 
in which microtubules are propelled by surface-bound dyneins. In 
this system it is possible to study the local interaction: we find that 
colliding microtubules align with each other with high probability. 
At high densities, this alignment results in self-organization of the 
microtubules, which are on average 15 pm long, into vortices with 
diameters of around 400 pm. Inside the vortices, the microtubules 
circulate both clockwise and anticlockwise. On longer timescales, 
the vortices form a lattice structure. The emergence of these struc- 
tures, as verified by a mathematical model, is the result of the 
smooth, reptation-like motion of single microtubules in combina- 
tion with local interactions (the nematic alignment due to colli- 
sions)—there is no need for long-range interactions. Apart from 
its potential relevance to cortical arrays in plant cells'*’” and other 
biological situations, our study provides evidence for the existence 
of previously unsuspected universality classes of collective motion 
phenomena. 

Our experimental in vitro motility assay consisted of dynein motor 
proteins grafted to a glass surface at a density of 750-2,500 molecules per 
square micrometre and to fluorescently labelled, stabilized microtubules 
(details can be found in Supplementary Methods). We used inner-arm 
dynein subspecies c (dynein c) purified from Chlamydomonas flagella. 
This drove the microtubules, with their plus ends leading, at a mean 
velocity of ~12 um s | in vitro at 23 °C in the presence of 1mM ATP, 
while rotating them around their longitudinal axis in a clockwise 
direction as viewed from their minus ends’*”’. (‘Plus’ and ‘minus’ here 
refer to microtubule polarity.) When introduced in a flow cell, the 
microtubules did not move and bound to the dynein-c-coated glass 
surface in the absence of ATP’*””. On addition of 1 mM Mg-ATP, they 
started moving smoothly on the surface at a fairly constant speed. No 
preferred direction of isolated microtubule motion was observed. The 


smoothness and isotropy of motion reflect the fact that relatively 
many, randomly oriented, dyneins were attached at any time to a given 
microtubule. 

Using microtubules at a concentration of 40 pg ml ', which is a few 
times greater than in conventional assays’*, we found the density of 
attached microtubules to be 0.05 um? (there were five microtubules, 
15.6+7.3 um in length (mean + s.d., n = 257), per 100 um?). On 
addition of ATP (at 0 min), we observed many aligning collisions 
(discussed further below) that gradually led to streams along which 
dozens of microtubules moved in both directions. The size of a stream 
increased in time (by more than 10 tm in width and more than 100 um 
in length) (12 min; Fig. 1a). The streams started to meander over the 
surface typically after 10 min. At 10-20 min, some vortices appeared 
(17 min; Fig. la and Supplementary Movie 1) and eventually covered 
the flow cell almost entirely, except for the area less than 500 j1m away 
from the boundaries (Fig. 1b). The shape of the vortices gradually changed 
over time but their diameters were almost constant, at 400 um (major 
axis, 443 + 64 ttm (s.d., 1 = 40); minor axis, 390 + 59 um (s.d., 2 = 40)). 
At this late stage, the vortices showed a tendency to arrange their 
positions into a hexagonal lattice, although not perfectly. A given vortex 
consists of a sparse core and a dense peripheral annulus inside which 
microtubules move both clockwise and anticlockwise in small streams 
and slide past each other (Fig. 2 and Supplementary Movie 2). That is, 
the streams show nematic rather than polar order. Microtubules were 
never trapped in a vortex: they would circulate inside one vortex for 
some time before moving to a neighbouring one or travelling farther 
and starting to revolve around a more distant core. 

As the first step in understanding vortex formation, we examined the 
interactions between microtubules. We decreased their density to 0.5- 
1 pg ml ' to focus on pair interactions between isolated microtubules. 
We examined 393 pair collision events. No significant interaction was 
detected until the pair collided, indicating that long-range, ‘hydro- 
dynamic’ effects are negligible. On collision, strong interactions due 
to volume exclusion occurred overwhelmingly (80% of events), 
leading either to alignment or anti-alignment (70%; Fig. 3a, b) or to 
the stoppage of one microtubule to prevent it from hitting the other 
(10%; Fig. 3c). In the remaining fraction of events (20%), the micro- 
tubules crossed each other with little effect on their trajectories 
(Fig. 3d, e; see also Supplementary Movie 3). In aligning and anti- 
aligning collisions, the trajectory of one microtubule undergoes a sharp 
turn (Fig. 3a, b) and alignment was observed to be near perfect. We 
believe that this is due to the leading end of the microtubule being 
relatively free from attached dyneins by comparison with rest of it: on 
meeting an obstacle, the tip of the microtubule can then bend more 
easily. Thus, the outgoing angle is near 0 or 1 in aligning events, 
irrespective of the value of the incoming angle, ¢. In stoppage and 
crossing events, the orientations of both microtubules remain roughly 
unchanged. (During stoppage events, one microtubule simply ‘waits’ 
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14.5 min 


Figure 1 | Emergence of vortices of microtubules. False-colour images. 

a, Formation process of vortex pattern: 0 min corresponds to the time of 
injection of ATP. Streams appeared around 5 min after this injection. At 

12 min, streams started to meander. Vortices started to appear around 17 min, 
when meandering streams contacted one another. Once vortices had formed, 
they grew steadily (22 min). Scale bar, 500 um. b, Large-scale lattice of vortices. 
Vortices can be observed everywhere on the surface of the flow cell. Three air 
bubbles in the flow cell can be seen distinctively owing to their greater size and 
thicker edges. Scale bar, 2mm. 


until the other has passed (Fig. 3c).) Given that the outcome of each 
type of collision is well defined, we represented the complete collision 
statistics in terms of the probability of each collision type as a function 
of ¢ only (Fig. 3d). We observe an overall symmetry around ¢ = 1/2, 
with alignment and anti-alignment occurring mainly for small (¢ ~ 0) 
and large (# ~ m) angles, respectively, whereas stoppage and crossing 
dominate for ¢ ~ 1/2. Thus, we conclude that near-perfect nematic 
alignment is induced by direct collision. 


Figure 2 | Nematic order inside a vortex stream. In this experiment, the 
microtubules were all stained fluorescent but approximately 0.1% of 
microtubules were stained to be more fluorescent than the rest. a, Close-up of 
vortex. Scale bar, 20 um. b, Spatiotemporal diagram of a magnified region of the 
vortex shown in a. Yellow lines indicate microtubules travelling from top to 
bottom, and blue lines indicate microtubules travelling from bottom to top. 
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Figure 3 | Collision between microtubules. a-c, Three different types of 
collision between two microtubules. In a and b, microtubule 2 collides with the 
side of microtubule 1 and changes its direction of motion abruptly, leading to 
almost perfect alignment (a) or anti-alignment (b). In c, microtubule 2 stops 
until microtubule 1 has passed. Scale bar, 10 tum. d, Probability of each 
interaction type as a function of # (see right-most images in a-c). We analysed 
393 events. Red lines represent alignment (solid) and anti-alignment (dotted) 
events; blue dotted lines represent stopping events; yellow solid lines represent 
crossing-over events. e, Total number of observations in each angular bin. 


Constant-speed particles aligning nematically in the presence of 
noise have been studied recently in mathematical models of self- 
propelled rods, which were shown to give rise to unexpected collective 
dynamics but not to vortical structures’. Our motility assay has 
properties not present in simple models of colliding rods. One such 
feature is the nature of the motion of isolated microtubules: as already 
mentioned, their reptation-like movement yields smooth but con- 
stantly turning trajectories, unlike both straight ballistic motion and 
the ragged behaviour of random walkers submitted to rotational dif- 
fusion, the individual dynamics adopted in rods models*”**. We 
analysed the trajectories of isolated microtubules for durations of up 
to 350 s (Fig. 4a, left). Here we further decreased their concentration, to 
4.8ngml_'. The speed (tangential velocity) of the microtubules 
fluctuated in time, with a mean of vp = 8.75 ums — 1 Wealso measured 
the direction of motion, 0 (where 0 =0 and 1/2 correspond to the 
direction of the x and y axes, respectively), and the instantaneous 
angular velocity, and found them to have no significant correlation 
with the instantaneous speed. This allowed us to neglect the speed 
fluctuations and focus on 0. 

A closer look at a representative trajectory (Fig. 4a, right) reveals 
small-amplitude transverse oscillations, which might be caused by the 
rotation of the microtubule around its axis’*. After an appropriate filter- 
ing, based on the Savitzky—Golay method, to eliminate these oscillations 
(Supplementary Methods), we obtained the variation of curvature, 
« = d0/ds, asa function of distance, s, along the trajectory and calculated 
its autocorrelation (Fig. 4b). (With this definition, « can be positive 
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Figure 4 | Motion of isolated microtubules. a, Left: trace of an isolated 
microtubule. Right: enlarged image of boxed area on left. The green and blue 
filled circles correspond to the start and end points of the observation. The open 
blue dots represent the position of the microtubule, and the red solid line 
corresponds to the smoothed curve. b, Autocorrelation of the trajectory 
curvature, «, with respect to the distance along a trajectory, s (red solid line). 
The green dotted line is an exponential fit with the correlation length 

So = 542 + 4m. ¢, Distribution of « values observed (bars) and Gaussian fit 
(dashed blue line): mean, Ky = —7.1 X 10-4 m7 };s.d.,0,.= 1.8 X 10-7 m7 !. 


(anticlockwise motion) or negative (clockwise motion).) Fitting the 
result with the exponential function exp(—s/sy), we found the persist- 
ent length, so, of curvature to be of the order of 500 tm. The distri- 
bution of « values observed is roughly Gaussian with mean 
Ko = —7.1 X 10° * um and standard deviation o,. = 1.8 X 107° um? 
(Fig. 4c). We note that these numbers correspond to length scales that 
are rather large and of the same order as the vortex diameter, and that 
the small negative value of ko indicates a preference for clockwise 
turning in single-microtubule trajectories, which is probably a con- 
sequence of the clockwise rotation around of the microtubules around 
their axes while in motion”. A detailed molecular-level study should 
confirm this. 

We constructed a mathematical model for the collective motion of 
microtubules in our experimental motility assay by incorporating the 
few ingredients uncovered in our analysis of the trajectories of single 
microtubules and of pair collisions. A deliberate choice of minimality 
was made, to stress that with only these very few features all observed 
collective phenomena are reproduced, and to allow for numerical simu- 
lations of millions of objects: for instance, the dynein motors driving the 
microtubules are not modelled explicitly; the hydrodynamic inter- 
actions induced by the fluid are neglected, in agreement with our 
observation that only direct contact modifies the microtubules’ motion. 
The microtubules themselves are not represented as flexible polymers: 
here, in the spirit of ref. 6, they are identical point particles moving at 
constant speed, vo (the speed fluctuations are also neglected). In direct 
agreement with the observations reported in Fig. 4, the free motion of 
particle i, with position x; and orientation 0;, is a biased Ornstein- 
Uhlenbeck process, w(t), with correlation time t = so/Vvo, acting on 
the particle’s instantaneous angular velocity, d0;/dt: 


dx; 
Te Vo(e, cos 0; +e, sin 0;) 


a (1) 
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da; 


P co) + C(t) (2) 


where e, and e, are respectively unit vectors in the x and y directions, 
é(t) is a Gaussian white noise with zero mean and variance 
o =2v02 / T=2viar /so, and p= VoKo is the preferred non-zero 
angular velocity induced by the mean curvature, Ko. This ensures that 
all correlation properties of the curvature measured in the experiment 
are reproduced (Supplementary Methods). 

These equations must be complemented with an interaction term. 
Only the nematic alignment collisions are modelled (because stoppage 
and crossings do not substantially modify the microtubule orienta- 
tions). As they were observed to induce much sharper turns in the 
trajectories (Fig. 3a, b) than the spontaneous curvature changes of free 
microtubules (Fig. 4a), these collisions are modelled as a nematic 
alignment term acting directly on the particle orientation 0j, that is, 
in equation (1), and not on the angular velocity, «;. Following previous 
studies®”°, we replaced explicit pair collisions by an effective alignment 
within an interaction range, /, which is naturally chosen to be the 
typical microtubule length. Equation (1) is thus replaced by 


1 
(a; 
T 


dx; : 

a Vo(€x cos 0; +e, sin 0;) 
3) 

d6; 

rae + pn 0;)) 


where «& is a parameter and the sum is over the n,(t) particles j within 
distance / of particle i at time t. 

Numerical simulations of equations (2) and (3) using experimentally 
obtained parameters (Supplementary Methods) show the emergence, 
from disordered initial conditions, of large-scale vortices with a diameter 
of the same order as those observed in experiments (Fig. 5a and Sup- 
plementary Movie 4). 

Inside vortices, the simulated particles revolved both clockwise 
and anticlockwise, with the majority moving clockwise, as expected 
from the non-zero value of wp used. The number of vortices increased 
over time and saturated, leading eventually to a regular hexagonal 
lattice structure. As in our experimental observations, vortices did 
not form at low densities. To see this more clearly, we calculated 
u’(n), the variance of the local density in small boxes containing n, 
particles on average, as a function of the rescaled average density, p, 
(the actual density multiplied by ?; Supplementary Methods), the 
other parameters being kept fixed (Fig. 5b). For p, <0.4, the system 
remains disordered and u*(n)/n, is almost constant, indicating the 
absence of density structures (Fig. 5b, right, images (i) and (ii)). 
However, u?(n)/n, increased with pa for p,>0.4, indicating the 
appearance of a density pattern, which we checked visually to be a 
vortex lattice (Fig. 5b, right, images (iii) and (iv)). This result is con- 
sistent with our experimental observation that the vortex pattern dis- 
appears when the density of microtubules is halved (compare with 
Supplementary Fig. 1). We also performed numerical simulations in 
which we varied the rescaled correlation time, 2 = vot/I (Supplemen- 
tary Methods), but kept all the other parameters fixed. Global nematic 
order without a density pattern was observed for A < 60, but the system 
showed vortex formation for 2 > 60 (Fig. 5c). Thus, the large persist- 
ence length and time of single-microtubule motion is necessary for the 
emergence of the vortices. Further information on the phase diagram 
of our mathematical model can be found in Supplementary Methods 
and Supplementary Fig. 2. 

The study of our mathematical model thus shows that smooth 
trajectories of single microtubules and their direct collisions inducing 
sharp bends and nematic alignment—both experimentally observed— 
are sufficient, when associated with self-propulsion, to account for all 
observed collective properties. In particular, there is no theoretical 
reason to invoke, nor is experimental evidence of, long-range ‘hydro- 
dynamic’ interactions or other ‘collective effects’ in our system. By 
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Figure 5 | Collective dynamics of mathematical model of interacting, self- 
propelled particles. See equations (2) and (3). a, Spontaneous formation of a 
vortex reproduced by our model. Each image is averaged for AT = 10, where T 
is simulation time rescaled by //vo. The brightest area corresponds to a density 
of 15 particles per unit area. Green and red respectively correspond to the 
densities of particles moving clockwise (@ < 0) and anticlockwise (« > 0). 
Yellow regions indicate areas with clockwise and anticlockwise motion in 
approximately equal proportions. The area of the figure is 512 X 512 
simulation cells. The last snapshot, at T = 19,900, shows the particle densities 
without differentiating between clockwise and anticlockwise motion, to make 
comparison with experimental results easy. b, Global nematic order parameter, 
S, and variance of the local density, u’(n) (calculated over boxes 4 simulation 
cells in linear size and AT = 10), divided by the average density for the box, n,, 
plotted against p,. Corresponding density distributions are shown in images 
(i)-(iv) (each corresponds to a different value of p, as shown, and is a sum over 
5,000 snapshots taken at intervals of AT = 1 ina 256 X 256 system). Here we set 
the rescaled correlation time, 1 = vot/I, to 2 = 100. ¢, Same as b, but plotted 
against A with p, = 10. The density distributions correspond to different values 
of 2, as shown. 


contrast, such interactions were relied on to explain some of the 
collective phenomena observed in the actomyosin system’. We 
believe that our observations are a consequence of using dynein c 
motors; in fact, when kinesin was used instead of dynein, microtubules 
mostly crossed each other and vortex formation could not be observed 
(Supplementary Fig. 3). With dynein, collisions are frequent enough to 
induce large-scale order. We suppose that this is due to the mechanical 
properties of dynein molecules and the asymmetry of their molecular 
structure. The formation of vortices arises from the smoothness and 
the curvature persistence of trajectories of single microtubules, both of 
which properties are, we believe, the consequence of having a relatively 
large number of motors attached at any time (we estimate it to be of the 
order of 100). 

The vortices observed in our motility assay possess unique features: 
their large scale (relative to the length of microtubules), the irrelevance 
of hydrodynamic interactions and the dominantly nematic order 
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inside them distinguish them from the polar, small-scale vortices 
formed by swimming sperm cells confined in quasi-two-dimensional 
conditions™. Their stability in time, and their well-defined diameters 
also distinguish them from the unstable ‘swirling patterns’ reported in 
the actomyosin assay’*. From a theoretical viewpoint, we are not aware 
of other models showing a large-scale vortex pattern of the type found 
here. In our model, the crucial feature introduced is the combination of 
smooth, reptation-like motion and sharp nematic alignment. This 
combination can be seen as the signature of a new class in the emerging 
landscape of universality classes of collective motion phenomena. 

Returning to the biological relevance of our findings, we believe that 
they stress further the message, in various in vivo situations and in 
particular in plant cell cortical arrays'®’’*°’°, that simple physical 
collisions can induce the reorientation of microtubules, creating 
large-scale structures even in the absence, here, of crosslinkers’””*. 
Furthermore, there is a striking analogy between our analysis of micro- 
tubule collisions and that performed in plant cell cortex'””°”’, although 
there is only treadmilling and no actual displacement of microtubules 
in the cortex. As shown in Fig. 3d, in our system the probability of 
aligning collisions is high at small incoming angles (¢ = 0 mod m) and 
decreases monotonically from 0 to 1/2, as for “zippering’ (positive-end 
entrainment) of cortical microtubules. 

Well-controlled in vitro experiments of the type reported here 
present crucial advantages due to the simplicity of the objects involved 
and the relative ease of collecting large amounts of data: not only can 
theoretical ideas be tested, but such experiments can reveal the import- 
ance of mechanisms previously ignored, and allow for quantitative 
analysis of features also present in vivo. They are and will remain 
invaluable tools for our understanding of active matter and biological 
organization processes. 


METHODS SUMMARY 


We prepared inner-arm dynein and tubulin using standardized protocols’”. Cy3- 
labelled microtubules were polymerized from a mixture of Cy3-tubulin and non- 
fluorescent tubulin and were stabilized using Taxol. Fractionated dynein c was 
diluted at various concentrations (27-90 g ml‘) in buffer solution containing 
30 mM HEPES/KOH (pH 7.4), 5 mM MgSO,, 1 mM DTT, 1 mM EGTA and | mg 
ml | bovine serum albumin. The flow cell was flushed with the diluted dynein 
solution, incubated for 5 min and then washed using the buffer solution. Solution 
containing microtubules as well as 1mM ATP and 1mM DTT was introduced 
into the flow cell. Microtubules were visualized on an epifluorescence microscope. 
The vortex pattern, binary collisions and individual trajectories were observed 
with solutions of microtubules with respective concentrations of 40 Lg ml, 
0.5-1p1gml~' and 4.8ngml'. For single-trajectory analysis, the position of a 
microtubule was measured every At = 0.68 s. 
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Collapse of polar ice sheets during the stage 11 


interglacial 


Maureen E. Raymo! & Jerry X. Mitrovica? 


Contentious observations of Pleistocene shoreline features on the 
tectonically stable islands of Bermuda and the Bahamas have 
suggested that sea level about 400,000 years ago was more than 20 
metres higher than it is today’*. Geochronologic and geomorphic 
evidence indicates that these features formed during interglacial 
marine isotope stage (MIS) 11, an unusually long interval of warmth 
during the ice age’*. Previous work has advanced two divergent 
hypotheses for these shoreline features: first, significant melting of 
the East Antarctic Ice Sheet, in addition to the collapse of the West 
Antarctic Ice Sheet and the Greenland Ice Sheet'*; or second, 
emplacement by a mega-tsunami during MIS 11 (ref. 4, 5). Here 
we show that the elevations of these features are corrected downwards 
by ~10 metres when we account for post-glacial crustal subsidence of 
these sites over the course of the anomalously long interglacial. On 
the basis of this correction, we estimate that eustatic sea level rose to 
~6-13 m above the present-day value in the second half of MIS 11. 
This suggests that both the Greenland Ice Sheet and the West 
Antarctic Ice Sheet collapsed during the protracted warm period 
while changes in the volume of the East Antarctic Ice Sheet were 
relatively minor, thereby resolving the long-standing controversy 
over the stability of the East Antarctic Ice Sheet during MIS 11. 

The stability of ice sheets in the face of continuing global warming is 
an issue of significant societal concern. Satellite gravity measurements 
indicate that the Greenland Ice Sheet (GIS) and the West Antarctic Ice 
Sheet (WAIS), the two ice sheets most susceptible to climate change, 
are experiencing a net mass loss*”, with evidence of an accelerating 
pace’ '*. In contrast, the current mass balance of the much larger East 
Antarctic Ice Sheet (EAIS) is uncertain, even in sign®”, though a recent 
study" has inferred EAIS mass loss localized to coastal regions. This 
uncertainty about the stability of the EAIS in a progressively warming 
world has been a key motivation for studies of the palaeoclimate record 
during past warm intervals. 

One such study, a statistical analysis of widely distributed sea-level 
markers related to the last interglacial (MIS 5e; about 120,000 years ago), 
concluded with 95% confidence that eustatic sea level (ESL; defined as 
the globally averaged sea-level change) was >6.6 m higher during MIS 
5e than at the present day, and with 66% confidence that ESL was 
>8.0m higher’’. (This inference, higher than earlier estimates”, is 
supported by a recent analysis of MIS 5e sea-level records from 
Florida’*.) Estimates of the ESL rise associated with collapse of polar 
ice sheets range from 3.4 m (ref. 16) to 7 m for the GIS, and from 3.2 m 
(ref. 17) to 5m for the WAIS, where the upper bounds refer to the 
complete disappearance of the ice sheet. Thus, whereas the estimate of 
peak ESL during MIS 5e implies significant collapse of both the GIS 
and the WAIS, it also implies that the EAIS remained relatively stable. 

It is within this context of assessing potential future instability of the 
EAIS that the sea-level highstand features found at ~20 m (here and 
elsewhere, height above present-day sea level is meant) in Bermuda 
and the Bahamas, and which formed during the MIS 11 interglacial 
(~424-395 kyr ago), have taken on great significance. MIS 11 spanned 
two precession cycles and was the longest interglacial of the past 


500 kyr (refs 18, 19), including the current interglacial MIS 1 (Fig. 1) 
and MIS 5e (Supplementary Fig. 4). If the ESL during the MIS 11 
interglacial peaked at a level 20 m higher than today’, then at least 
8 m of that rise must have come from melting of the EAIS. Geologic 
evidence for a ~20-m sea-level highstand in Bermuda and the 
Bahamas is convincing. In Bermuda, reasonably well-dated deposits 
with thalassinidean shrimp burrows, foraminifera, and gastropods 
characteristic of littoral and intertidal environments constrain relative 
sea level at 21.3 + 1.0m during MIS 11 (refs. 2, 3). On Eleuthera, in the 
Bahamas, a gently sloping erosion surface capped with fenestrae-rich 
intertidal beach deposits provides a maximum sea-level estimate of 
20 + 3m, and the occurrence of pendant fibrous cements suggests a 
minimum sea level of 17 + 2 m (we will henceforth quote a sea-level 
estimate of 18.5 + 3.6 m for this site); multiple dating methods suggest 
that these deposits were formed during MIS 11 (ref. 1). 

How do these observations compare to other MIS 11 sea-level 
indicators or proxies? In a recent survey of MIS 11 sea-level records 
worldwide (most of which are located in tectonically active regions), 
Bowen’ estimated peak MIS 11 sea level using a range of tectonic uplift 


Time (kyr ago) 
-20 -10 0 10 20 30 
3.0L Te! 
0 
o 
2 
2 3.5- 
= = 
(eo) 40 § 
; a 
2 40h m 
E Z 
n = 
3 

2 -80 = 
3 45 
Pos —— LR04 MIS11 
pa 
& —— LR04 MISs1 
=) || eae Sea-level model 120 

5.0- 

| | | | 
390 400 410 420 430 440 
Time (kyr ago) 


Figure 1 | Comparison of the duration of the MIS 11 and MIS 1 
interglacials. Plot of the LR04 benthic oxygen isotope stack’* (left-hand 
vertical axis) over a time window spanning the MIS 11 (blue; bottom time scale) 
and MIS 1 (red; top time scale) interglacials. The mean standard error on 5180 
in the LR04 stack is 0.06%o with an age error of +4 kyr for the intervals 
considered here. The juxtaposition illustrates the significantly longer duration 
of maximum interglacial conditions during MIS 11 relative to MIS 1. ESL 
associated with the model ice history used to calculate GIA effects during MIS 
11 is shown by dashed line (right-hand vertical axis). Note the hiatus in model 
ice volume changes from 410 to 401 kyr ago (black bar). An analogous 
comparison between the duration of MIS 11 with MIS 5e can be found in 
Supplementary Fig. 4. 
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corrections based on different assumed ages and elevations of MIS 5e 
sea-level markers. If we adopt Kopp et al.’s’* recent inference of the age 
and peak magnitude of ESL during MIS 5e as a constraint on the latter, 
then Bowen’s estimates from far-field sites imply that peak sea level 
during MIS 11 was 6-9 m above the present-day value, an estimate 
consistent with marine 5'°O isotopic anomalies*’. However, Bowen 
excluded a number of outliers, including the well-studied Bermuda 
and Bahamas sites, and following McMurtry et al.*, he attributed the 
anomalously high elevation of shoreline features at those sites to 
deposition by storms or a mega-tsunami. 

The two explanations for the MIS 11 highstand features in Bermuda 
and the Bahamas—global sea-level rise associated with significant 
EAIS melting’, or mega-tsunami deposition**—ignore the potential 
signal from glacial isostatic adjustment (GIA). The GIA signal may be 
significant at the two sites, as they both are located on the peripheral 
bulge of the ancient Laurentian ice complex. As a consequence, they 
would be subject to local crustal subsidence and sea-level rise of ampli- 
tude ~1-2 mmyr ' during any interglacial". 

To explore the potential contribution from GIA to MIS 11 highstand 
elevations, we compute global sea-level variations over the past 500 kyr 
using a gravitationally self-consistent theory valid for spherically 
symmetric, linear viscoelastic Earth models” (see Supplementary 
Information). As an illustration of the physics of interglacial sea-level 
trends, we plot the predicted change in sea level across the warmest, 
most stable interval in MIS 11 (as implied by 5'°O records), a 9-kyr 
period spanning 410-401 kyr ago that occurred during the second half 
of MIS 11 (Fig. 2). The predictions adopt an Earth model with upper- 
mantle and lower-mantle viscosities of 5 X 107° Pas and 5 X 10*! Pas, 
respectively (model LM”***; see Supplementary Information). 

In the near-field of the former MIS 12 ice sheets, the predicted sea- 
level change is dominated by radial crustal motions. For example, in 
regions once covered by ice sheets, post-glacial rebound of the crust 
produces a sea-level fall of amplitude up to ~1-2 cm yr _', leading toa 
net fall over 9 kyr that can exceed 100 m (these amplitudes are well off 
the scale of Fig. 2). Surrounding these regions of uplift are peripheral 
bulges predicted to subside at rates of up to 0.2-0.3 cmyr_‘ during the 
interglacial. This subsidence accounts for the (red) zones of sea-level 
rise in Fig. 2, with maximum amplitude of ~20m, that encircle the 
Laurentian, Fennoscandian and Antarctic ice complexes. In contrast, a 
sea-level fall of 2-3 m is predicted across most of the far-field of the 
former ice sheets during MIS 11. This fall is due to a combination of 
two processes: (1) deglaciation-induced ocean loading effects, which 
act, near continental margins, to tilt the crust such that continents 
are deformed upward (that is, sea level falls) and offshore regions 
downward’; and (2) the redistribution—or syphoning—of water 
from ocean basins towards regions of peripheral bulge and offshore 
subsidence”. 
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Figure 2 | Predicted sea-level change across the hiatus in ice volume changes 
(410-401 kyr ago) spanned by the model MIS 11 interglacial. The GIA 
calculation is based on the ice history discussed in Supplementary Information 
and the LM” viscosity model. The colour scale saturates in regions within the 
near-field of the late Pleistocene ice sheets. 
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Following Fig. 2, we conclude that in the absence of interglacial ice 
volume changes and steric effects, sea-level highstands will date to the 
end of the interglacial at sites within the subsiding peripheral bulge of 
major ice centres, and to the beginning of the interglacial over most of 
the far-field (Fig. 3). (Of course, any sea-level signal associated with 
interglacial ice volume variations and with ocean temperature and 
salinity changes will be superimposed on these predicted GIA trends.) 
Consider Bermuda: the predicted relative sea-level history for this site 
(Fig. 3 and Supplementary Fig. 1, solid red line) exhibits a monotonic 
rise across MIS 11, reaching a level 11.9 m above the present-day value. 
The predicted highstand for the Bahamas (dashed red line, Fig. 3), 
lying on the outer flank of the same peripheral bulge, is 7.4m. It is 
clear that the amplitude of highstands at Bermuda or the Bahamas, 
indeed at all sites within a peripheral bulge, will be a strong function of 
the duration of the interglacial highstand. In this regard, the protracted 
length of MIS 11, relative to the Holocene (Fig. 1) or MIS 5e (Sup- 
plementary Fig. 4), should be manifested by highstands of particularly 
large magnitude. From this it also follows that simple field com- 
parisons of the relative elevations of Holocene, MIS 5e and MIS 11 
sea-level markers on these islands would lead to an erroneous assess- 
ment of the difference in ESL between those times. 

This result may be stated with more generality. The elevation of an 
ancient interglacial highstand at any site within a peripheral bulge will 
be governed by the difference in the state of isostatic disequilibrium at 
the end of the ancient interglacial relative to the disequilibrium at the 
present day in that same location. As the MIS 11 interglacial was sig- 
nificantly longer than the current interglacial (or MIS 5e), more sub- 
sidence had occurred by the time the final MIS 11 shoreline indicators 
were emplaced on those islands than during the current (or MIS 5e) 
interglacial. Further, as the current interglacial proceeds, sites like 
Bermuda and the Bahamas will be subject to a continuing sea-level rise 
as the present peripheral bulge continues to subside. Thus, the height of 
these ancient MIS 11 highstand features will progressively fall as time 
progresses (even in the absence of any future change in ice volume). 

By contrast, the highstand amplitude at sites located in the far field 
will depend on the difference in isostatic disequilibrium at the beginning 
of the interglacial relative to the disequilibrium at the present day. 
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Figure 3 | Predicted relative sea-level changes across the model MIS 11 
interglacial. The calculations are based on the ice history discussed in 
Supplementary Information and the LM viscosity model. The top two curves 
are predictions for Bermuda (solid red line) and the Bahamas (dashed red line) 
at locations of published data discussed in text. The lower coloured curves, 
predictions for a number of far-field sites, are included for comparison: the Gulf 
of Aden, Red Sea (brown), Curacao (green), Coorong, South Australia 
(magenta), Oahu, Hawaii (blue) and Cape Town, South Africa (cyan). The 
black line shows the ESL variation associated with the adopted ice history (as in 
the dashed blue line in Fig. 1 and the black line in Supplementary Fig. 1) and the 
thin horizontal dashed line references to present-day sea level. 
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Thus, perhaps counter-intuitively, the current elevation of these 
highstands does not depend on the duration of the ancient interglacial. 

Global predictions of current MIS 11 shoreline elevations within each 
of the two zones discussed above are shown in Fig. 4a and b: Fig. 4a 
encompasses sites where the highstand occurred at the end of the 
modelled interglacial hiatus in ice mass change (for example, 401 kyr 
ago) whereas Fig. 4b encompasses sites where the highstand occurs at 
the beginning of the modelled hiatus (410 kyr ago). These results 
represent the predicted contribution to observed MIS 11 highstands 
from GIA alone; this contaminating signal should be removed from 
geological observations before they are used to infer the difference in 
ESL (or ice volume) between MIS 11 and the present. 

As noted above, the GIA contribution to the MIS 11 shoreline 
elevations in Bermuda and the Bahamas is predicted to be 11.9m 
and 7.4m, respectively (Fig. 3). Correcting the published highstand 
estimates at these sites for these GIA predictions yields residual MIS 11 
sea-level elevations of 9.4+ 1m for Bermuda and 11.1+3.6m in 
Eleuthera (Supplementary Table 1). Although additional field obser- 
vations in the Bahamas could greatly decrease the multi-metre error 
associated with these sea-level estimates, we conclude that no signifi- 
cant contribution from melting of the EAIS is required to explain the 
geologic observations at the two sites. 

In our analysis, we implicitly assume that melting of the WAIS and 
the GIS occurred towards the end of the MIS 11 interglacial, such that 
the contribution of the melt event to sea level at Bermuda and the 
Bahamas could be added to a coeval GIA-induced highstand. This 
timing for ice sheet collapse is strongly supported by at least two 
arguments. First, if the collapse took place at the beginning of the 
hiatus in MIS 11 melting (that is, at ~410 kyr ago), then the associated 
ESL rise would have had to be close to 20 m in order to fit the observed 
highstand elevations at Bermuda and the Bahamas. However, in this 
case, the predicted highstand elevation at far-field sites would have 
been even greater than 20 m (that is, an ~20-m melt signal added to 
the GIA-induced highstand ofa few metres; Fig. 3), a situation apparently 
ruled out by far-field observations’. Second, GIA calculations predict a 
submergence of up to several metres for far-field sites by 401 kyr ago 
(Fig. 3) and an average submergence of ~1 m at the specific far-field 
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Figure 4 | Plot of the predicted GIA-induced present-day elevation of MIS 
11 highstands that are currently greater than zero. Elevation is shown for all 
sites in which sea-level peaks either at the end of the hiatus in ice volume 
changes spanned by the model MIS 11 interglacial (a; that is, at 401 kyr ago; see 
also Supplementary Fig. 1) or at the beginning of that hiatus (b; that is, at 
410 kyr ago). 
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sites considered by Bowen’. Therefore, correcting an inferred sea-level 
peak of 6-9 m for this GIA-induced contamination yields a predicted 
ESL ~1m higher (that is, 7-10m), in accord with the residual 
highstand elevations at Bermuda and the Bahamas cited above (Sup- 
plementary Table 1). 

In the Supplementary Information we present results of sensitivity 
tests related to GIA predictions. First, we extend the duration of the 
MIS 11 hiatus in ice mass change from 9 to 14 kyr. In this case, the GIA- 
induced elevation of MIS 11 shorelines at sites within the Laurentian 
peripheral bulge are predicted to increase by ~1-2 m when viscosity 
model LM is adopted, yielding residual MIS 11 highstand elevations of 
7.0 + 1m for Bermuda and 9.9 + 3.6 m in Eleuthera (Supplementary 
Table 1), and thus lowering the inferred peak ESL during MIS 11 by 
~1-2m. Second, we considered an alternative viscosity profile, VM2 
(ref. 26), characterized by a lower-mantle viscosity that is a factor of 
two smaller than model LM. In this case, the predicted GIA contri- 
bution to MIS 11 highstand elevations within the peripheral bulge is 
reduced by a factor of ~2. This reduction yields residual (GIA- 
corrected) MIS 11 elevations of 15.0+1 m and 14.3+3.6m at 
Bermuda and the Bahamas, respectively, values that appear to be at 
odds with lower far-field estimates®. Last, we performed a Monte Carlo 
parameter search in which we varied mantle viscosity, lithospheric 
thickness, and the duration of the modelled hiatus in MIS 11 ice 
volume changes, and tested for the consistency of the GIA-corrected 
highstand elevations. This exercise yields a preferred bound on the 
peak ESL during MIS 11 of 6-13 m. 

In summary, observations of MIS 11 highstand features in Bermuda 
and the Bahamas must be corrected for the significant contaminating 
signal of GIA before these data are used to infer peak ESL during the 
MIS 11 interglacial. We conclude that ESL reached ~6-13 m above the 
present-day value in the late stages of MIS 11. It is unlikely that ocean 
thermal expansion contributed more than ~1 m to this inference’*”’. 
Therefore, although this estimate of peak ESL indicates significant 
collapse of both the WAIS and GIS during the MIS 11 interglacial, it 
rules out any significant melting of the EAIS during this unusually 
prolonged period of ice age warmth. 
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Adaptive radiation of multituberculate mammals 
before the extinction of dinosaurs 


Gregory P. Wilson’, Alistair R. Evans’, lan J. Corfe*, Peter D. Smits'?, Mikael Fortelius?* & Jukka Jernvall® 


The Cretaceous-Paleogene mass extinction approximately 66 million 
years ago is conventionally thought to have been a turning point in 
mammalian evolution’. Prior to that event and for the first two- 
thirds of their evolutionary history, mammals were mostly confined 
to roles as generalized, small-bodied, nocturnal insectivores’, 
presumably under selection pressures from dinosaurs*. Release 
from these pressures, by extinction of non-avian dinosaurs at the 
Cretaceous-Paleogene boundary, triggered ecological diversification 
of mammals'”. Although recent individual fossil discoveries have 
shown that some mammalian lineages diversified ecologically during 
the Mesozoic era’, comprehensive ecological analyses of mammalian 
groups crossing the Cretaceous—Paleogene boundary are lacking. 
Such analyses are needed because diversification analyses of living 
taxa°’ allow only indirect inferences of past ecosystems. Here we show 
that in arguably the most evolutionarily successful clade of Mesozoic 
mammals, the Multituberculata, an adaptive radiation began at least 
20 million years before the extinction of non-avian dinosaurs and 
continued across the Cretaceous-Paleogene boundary. Disparity in 
dental complexity, which relates to the range of diets, rose sharply in 
step with generic richness and disparity in body size. Moreover, 
maximum dental complexity and body size demonstrate an adaptive 
shift towards increased herbivory. This dietary expansion tracked 


Ctenacodon 


Ptilodus 


Occlusal 


the ecological rise of angiosperms® and suggests that the resources 
that were available to multituberculates were relatively unaffected 
by the Cretaceous—Paleogene mass extinction. Taken together, our 
results indicate that mammals were able to take advantage of new 
ecological opportunities in the Mesozoic and that at least some of 
these opportunities persisted through the Cretaceous-Paleogene 
mass extinction. Similar broad-scale ecomorphological inventories 
of other radiations may help to constrain the possible causes of mass 
extinctions”. 

Multituberculate mammals were a taxonomically rich*"’ and 
numerically abundant’’ clade that had originated by the Middle 
Jurassic epoch (approximately 165 million years (Myr) ago) and went 
extinct in the late Eocene (approximately 35 Myr ago)*''. They were 
nearly globally distributed’ and had a distinctive dentition consisting 
of procumbent incisors, blade-like premolars, molars with longitudinal 
rows of cusps (Fig. 1) anda predominantly posteriorly directed (palinal) 
chewing motion'*”. 

Palaeontologists have agreed for a long time that the success of 
multituberculate mammals was at least partly related to their highly 
derived dentition. Despite this, there is little consensus on the inter- 
pretations of their feeding ecology, perhaps owing to the limitations of 
previous approaches. For example, toothwear analysis is time intensive 
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Figure 1 | 
Dental and dietary diversity in multituberculate mammals. a—d, Three- 
dimensional buccal-occlusal and occlusal reconstructions of multituberculate 


lower-right cheek tooth rows for GIS analysis: Late Jurassic plagiaulacid 
Ctenacodon serratus (a), Paleocene ptilodontoid Ptilodus kummae (b), Late 


(84-66 Myr ago) 


(65-64 Myr ago) 
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Orientation 


Cretaceous cimolomyid Cimolomys gracilis (c), Paleocene taeniolabidid 
Taeniolabis taoensis (d). Surface orientation map of each three-dimensional 
reconstruction (colour wheel indicates orientation) for OPC measurements 
(shown by the number of coloured patches). Clumps that are smaller than three 
grid points (black) are ignored. p, premolar; m, molar. Scale bars, 1 mm. 
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and requires high-quality preservation’, and multituberculates lack 
living descendants and this hampers comparative studies. Inter- 
pretations of multituberculate feeding ecology therefore vary widely; 
they have been proposed to be broad herbivores, frugivores, granivores, 
root- and bark-eaters, egg-eaters, insectivores, carnivores and 
omnivores'*"”, 

To obtain a robust and comprehensive view of multituberculate 
ecomorphological diversity through time, we quantified dental com- 
plexity in 41 genera using geographic information systems (GIS) 
analyses'* of three-dimensional crown surfaces of lower cheek teeth 
(Fig. 1; Supplementary Table 1). These analyses do not require cusp 
and facet homologies to be established, which can be a challenging task 
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when comparing morphologically and phylogenetically divergent taxa. 
Orientation patch count (OPC), a measure of dental complexity, was 
calculated as the number of discrete surfaces on the cheek tooth row 
distinguished by differences in orientation (for example, north, 
southwest; Fig. 1). Extant rodents, carnivorans and bats demonstrate 
a robust correlation between OPC and feeding ecology; OPC increases 
across the dietary spectrum from carnivores to omnivores to herbivores, 
despite many differences in specific tooth components, body size 
and chewing mechanics among these taxa’*’’. For a given clade, the 
standard deviation of OPC is an effective proxy for the dietary diver- 
sification and divergence in feeding function: higher standard devi- 
ation means greater dental disparity and a broader range of diets. OPC 
thus offers promise as a powerful tool for quantifying overall tooth 
shape and inferring diet in extinct taxa, such as multituberculates, that 
have highly derived dentitions with uncertain homology with living 
mammals and imprecise functional analogy. 

Among the earliest multituberculates, the ‘Plagiaulacida’ are a 
paraphyletic assemblage of taxa with up to four simple blade-like 
premolars and two multi-cusped molars (Fig. 1a). OPC analyses of 
‘plagiaulacids’ from the Late Jurassic through to the Early Cretaceous 
epoch (from 156-100 Myr ago) show low and tightly constrained 
dental complexity (Fig. 2a; OPC, 84-125). Their OPC values corre- 
spond to carnivory and the low end of animal-dominated omnivory 
among extant mammals (for example, eating both insects and fruits). 
Multituberculates in the early Late Cretaceous (100-84 Myr ago), 
which include mostly basal members of the suborder Cimolodonta, 
had a slightly higher mean OPC than did the ‘plagiaulacids’ but 
retained the low standard deviation of OPC (Fig. 2a), indicating low 
morphological disparity. A distinct break occurred 84-66 Myr ago, in 
the latest Cretaceous, when the mean OPC rose and peaked within the 
Campanian (mean OPC, 145) and maximum OPC and disparity 
sharply increased as well (Fig. 2a; OPC, 70-230). Of the 17 taxa for 
this interval, 5 have OPC values that are greater than 160 and 2 have 
OPC values that are greater than 200, corresponding to values for 
plant-dominated omnivory and herbivory among extant mammals, 
respectively (Fig. 2a). 

Finally, in the early Paleocene (66-62 Myr ago) multituberculates 
maintained high OPC (mean OPC, 138) and disparity peaked 
(Fig. 2a; OPC, 70-348). The early Paleocene Taeniolabis from North 
America (Fig. 1d) had the highest OPC among multituberculates (OPC, 
348), which exceeded OPC values of extant herbivorous rodents and 
carnivorans'’. Of the 16 other early Paleocene taxa, 4 have OPC values 
greater than 160 and 1 has an OPC value greater than 200 (Fig. 2a). 
Mean OPC and disparity of OPC declined during the remainder of the 
Paleocene and in the early Eocene (62-49 Myr ago). Ectypodus, the 
only known genus from the middle to late Eocene (49-35 Myr ago), has 
alow OPC (109), corresponding to the high end of the range for extant 
mammalian carnivores. This sharp drop in dental ecomorphological 


Figure 2 | Temporal patterns of multituberculate dental complexity, 
taxonomic richness, body size and angiosperm ecological diversification. 
a, Dental complexity (measured by OPC) for 41 multituberculate genera (solid 
black lines) and estimated for 24 additional genera (dashed black lines; see 
Supplementary Information) with mean dental complexity (OPC; red line) and 
disparity as standard deviations of OPC (blue line) in 5-Myr bins. Lengths of 
horizontal lines represent temporal ranges of taxa or uncertainties in ages of 
fossil localities. Labels for dietary classes are positioned at the lower end of their 
range based on OPC values in modern mammals'*. Green shaded area 
represents plant-dominated omnivory (PD omni) and herbivory. 

b, Taxonomic richness is equal to the number of genera per 5-Myr bin (blue 
shaded area). c, Body-mass estimates for 156 multituberculate species (solid 
black lines) with geometric means (red line) and disparity (blue line) for each 
5-Myr bin. d, Angiosperm within-flora richness (solid green line) and relative 
abundance (dashed green line) from ref. 24 (+7.5-Myr moving averages). Leaf 
hydraulic capacity of angiosperms as leaf-vein density (green circles) from ref. 
25. The 136-131-Myr bin was excluded from all analyses (Supplementary 
Information). AD omni, animal-dominated omnivory; E, early; Hypercarn, 
hypercarnivory; M, middle; L, late; O, Oligocene; P, Paleocene. 
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disparity is intriguing in light of the hypothesis that some late 
Paleocene to early Eocene eutherian lineages (for example, rodents) 
competitively displaced multituberculates”. 

Variable sampling of fossils through time may influence our 
inferred patterns of multituberculate radiation. In particular, the rela- 
tively depauperate Jurassic and Early Cretaceous taxonomic richness 
of multituberculates may be partly due to the limited fossil record’. To 
test whether the changes in OPC through time might be sensitive to 
uneven sampling of fossils, we randomized the OPC value assignments 
for each genus. The results of 5,000 randomized mean OPC profiles 
through time indicate that even with additional fossil discoveries, the 
OPC patterns and ecological inferences that are presented here are 
likely to be robust (see Supplementary Fig. 8). 

The overall trend of increasing multituberculate dental complexity 
was driven by increases in the number of cusps per molar and the 
relative size of molars in the cheek tooth row (Supplementary Fig. 1). 
This evolutionary pathway differs from that taken by ungulates, 
primates, rodents and lagomorphs, which responded to increased 
mechanical processing demands (that is, greater plant component) 
by evolving multi-cusped, molar-like premolars in addition to 
complex molars”. Among several other mammalian lineages with 
blade-like premolars (for example, carpolestid primates and potoroid 
marsupials), only sthenurine marsupials re-evolved multi-cusped 
premolars”. Developmental, functional or structural constraints that 
are associated with the evolution of a blade-like premolar may have 
inhibited later evolution of more cuspidate premolars. 

Patterns of generic richness and conservative estimates of body size 
for multituberculates (see Methods Summary) mirror the pattern of 
initially low dental complexity for most of the Mesozoic followed by a 
marked increase just before the end of the Cretaceous (84-66 Myr ago; 
Fig. 2b, c). From the Late Jurassic to the early Late Cretaceous, 
multituberculate generic richness fluctuated between 4 and 16 genera, 
and body mass ranged between 9 g and 105 g. A marked shift occurred 
near the end of the Cretaceous, with generic richness increasing from 
16 to 21 genera and average body mass increasing from 67 g to 194 g, 
in particular, Bubodens magnus reached an estimated 5.25 kg (for a 
comparison, the alpine marmot weighs 3-8 kg). Similarly, Cenozoic 
generic richness and body size patterns parallel changes in dental 
complexity (Fig. 2b, c). However, it should be noted that body size 
and dental complexity are not interchangeable as predictors of diet 
because small multituberculates can have high OPC values, and large 
multituberculates can have low OPC values (Supplementary Informa- 
tion), conforming to previous analyses of extant taxa!®. Furthermore, 
variation in the amount of available fossil-bearing rock through the 
studied interval could contribute to the pattern of generic richness”. 

It is worth noting that increases in dental complexity (OPC of greater 
than 160) occurred in five multituberculate lineages: in the Asian 
Djadochtatheroidea, the North American Eucosmodontidae and 
Cimolomyidae (just before the end of the Cretaceous), the North 
American and Asian Taeniolabididae, and the North American 
Microcosmodontidae (in the early Paleocene). In the absence of a 
robust multituberculate phylogeny, it is unclear whether this repre- 
sents a single evolutionary increase in tooth complexity, with multiple 
reversals to low complexity, or parallel increases in individual lineages 
or clades. This increase differs from the more commonly observed 
pattern, in which increased morphologic disparity precedes taxonomic 
diversification”’, as this increase occurred in step with an increase in 
multituberculate body size and taxonomic richness. 

Overall, the pattern of increasing dental complexity in multi- 
tuberculate mammals that pre-dates the Cretaceous—Paleogene mass 
extinction event contrasts with conventional ideas that mammalian 
evolution was suppressed during the Mesozoic era by selective pressures 
imposed by dinosaurs. Instead, multituberculates, the mammalian clade 
that co-existed for the longest time with non-avian dinosaurs, initiated 
an evolutionary radiation during the acme of dinosaur diversity, 
approximately 20 Myr before the Cretaceous—Paleogene boundary”. 
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This is consistent with the highly specialized adaptations that are found 
among several new, exceptionally well-preserved Mesozoic mammal 
specimens’ and with the timing of increased molecular divergence 
rates among extant mammalian lineages®’, but the multituberculates 
also show broad taxonomic and ecomorphologic diversification at the 
intraordinal level. 

This adaptive response post-dated the taxonomic radiation of 
angiosperms but coincided broadly with increases in ecological diversity, 
abundance and leaf hydraulic capacities of angiosperms in the Late 
Cretaceous (in the Campanian and Maastrichtian)*’** (Fig. 2d), 
suggesting that there is a causal link. Many angiosperms during this 
time were herbaceous, and had a rapid life cycle and less-effective 
herbivore defences compared to other seed plants and would con- 
sequently have been an attractive, protein-rich food source for 
herbivores”’. Some angiosperms had begun inhabiting a broader range 
of niches (trees, herbs and epiphytes) than most other plant groups and 
this may have enabled greater partitioning of the herbivore niche. 
Therefore, the multituberculate adaptive radiation, suggested by the 
increasing dental complexity, may have been triggered mainly by new 
niche space that was generated through the evolutionary and ecological 
radiations of angiosperms, but may have also been influenced by 
parallel radiations of associated non-angiosperm clades (for example, 
ferns’’), insect pollinators, dispersers and herbivores”*. A trophic link 
between angiosperms and some multituberculates is supported by our 
inferred dietary trend towards increased plant-dominated omnivory 
and herbivory among multituberculates just before the end of the 
Cretaceous and in the early Paleocene. 

Our data also show that the dietary range of multituberculates did 
not decrease in response to the Cretaceous—Paleogene mass extinction 
event (Fig. 2a). Despite substantial taxonomic turnover of multi- 
tuberculates at the Cretaceous—Paleogene boundary”, they seem to 
have experienced little change in available food resources during one 
of the most severe extinction events in Earth’s history. This apparent 
indifference of the multituberculate radiation to the Cretaceous— 
Paleogene event underscores the ecological selectivity of extinctions 
and suggests that broad-scale ecomorphological inventories of radia- 
tions may help to constrain the possible causes of extinction for other 


groups at the Cretaceous—Paleogene boundary””®. 


METHODS SUMMARY 


We scanned 48 dentitions from 41 genera of multituberculates. The three- 
dimensional scans of lower cheek tooth rows were analysed using GIS to quantify 
the number of discrete orientation patches using eight orientation directions. The 
mean of repeated measurements for eight rotations at multiples of 5.625° was used 
to reduce the effect of slight variations in the orientation of teeth. We compiled 
generic richness data from recent compendia and the primary literature. We 
estimated body mass from skull length, whenever available, and a tooth size to 
skull length regression formula. Detailed methods and calculations of dental com- 
plexity, generic richness and estimated body mass, data sets, measurements, cal- 
culations, randomization analyses and additional references are provided in the 
Supplementary Information. 
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imperfect mimicry 
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Although exceptional examples of adaptation are frequently 
celebrated, some outcomes of natural selection seem far from perfect. 
For example, many hoverflies (Diptera: Syrphidae) are harmless 
(Batesian') mimics of stinging Hymenoptera’. However, although 
some hoverfly species are considered excellent mimics, other species 
bear only a superficial resemblance to their models’ and it is unclear 
why this is so. To evaluate hypotheses that have been put forward to 
explain interspecific variation in the mimetic fidelity of Palearctic 
Syrphidae we use a comparative approach. We show that the most 
plausible explanation is that predators impose less selection for 
mimetic fidelity on smaller hoverfly species because they are less 
profitable prey items. In particular, our findings, in combination 
with previous results, allow us to reject several key hypotheses for 
imperfect mimicry: first, human ratings of mimetic fidelity are 
positively correlated with both morphometric measures and avian 
rankings, indicating that variation in mimetic fidelity is not simply 
an illusion based on human perception’; second, no species of 
syrphid maps out in multidimensional space as being intermediate 
in appearance between several different hymenopteran model 
species, as the multimodel hypothesis” requires; and third, we find 
no evidence for a negative relationship between mimetic fidelity and 
abundance, which calls into question the kin-selection® hypothesis. 
By contrast, a strong positive relationship between mimetic fidelity 
and body size supports the relaxed-selection hypothesis”*, suggest- 
ing that reduced predation pressure on less profitable prey species 
limits the selection for mimetic perfection. 

Much of evolutionary theory relating to mimicry has been based on 
the assumption that the fidelity of mimicry in any given system is 
extremely high (reviewed in ref. 9). Of course, if mimicry were poor, 
one might intuitively expect that signal receivers would learn be able to 
distinguish mimics from their models and (depending on context) 
accept or reject them. In reality, however, there are many examples 
of inaccurate or low-fidelity mimics****"°. For example, the relative 
compositions of key odour compounds of specific non-rewarding 
orchids differ markedly from the bees they have evolved to resemble’’. 
Similarly, although mimetic spiders of the genus Cosmophasis bear 
some resemblance to ants, they are readily visually discriminated by 
the human observer"'. Perhaps the best examples of imperfect mimics 
are found in hoverflies (Diptera: Syrphidae), which are considered 
Batesian (harmless) mimics of hymenopteran models’ but seem to 
vary markedly across species in the degree of mimetic perfection*’” 
(see Supplementary Fig. 1). 

Numerous hypotheses have been proposed to explain the evolution 
and maintenance of ‘imperfect’ mimicry, but despite calls for study 
there have been no comparative tests of their validity®’®. Common 
(and interrelated) explanations include the following: first, “eye of 
the beholder’, such that poor mimics to human eyes remain good 
mimics to natural predators*; second, ‘multimodel’, such that mimics 
gain most benefit from imperfect similarity to multiple models’; third, 
kin selection, such that imperfect mimicry is maintained through its 
benefit to conspecifics carrying the same trait®; fourth, constraints such 


that some factor limits the evolution of more perfect mimicry* includ- 
ing trade-offs (for example mimetic perfection versus thermoregula- 
tion’* or camouflage’*), phylogenetic constraints on body plan, or lack 
of genetic variation; and fifth, relaxed selection, such that selection 
for mimicry gradually weakens to a point at which it is readily 
counteracted by weak selection or mutation’* (see Supplementary 
Information for additional hypotheses). Here we describe new com- 
parative morphological and genetic data that allow us to evaluate 
which of these hypotheses are best able to explain the range of mimetic 
perfection seen in hoverflies. 

A long-standing challenge in elucidating the ultimate causes of 
imperfect mimicry is that of quantifying the extent of mimetic fidelity 
between mimics and models, which frequently differ in multiple trait 
dimensions ranging from colour to shape. Previous measures of 
mimetic fidelity have used human rankings'®, pigeon responses'*””, 
pixel mapping’, neural networks’® and multivariate analyses’*. We 
employed subjective human rankings of mimetic fidelity (hereafter 
fr; see Methods) across a range of species that were compared for 
consistency against a measure derived from a multivariate analysis of 
trait values (hereafter fp; see Methods). 

Overall, our morphological analysis of 38 syrphid species and 
10 hymenopteran model species indicated that there was a clear 
statistical difference in appearance between the two taxa (Syrphidae 
and Hymenoptera) (nested multivariate analysis of variance: taxon, 
Fy 427 = 797.77, P< 0.001; species, F477, 427 = 11.03, P< 0.001). The 
first three canonical variates in a generalized canonical discriminant 
analysis (GCDA) explained 80.6% of the variance between the species 
in terms of their morphological features (individually: 41.1%, 20.4% 
and 19.2%). The Mahalanobis distances between each of the syrphid 
species and each of the Hymenoptera groups were calculated from 
these three canonical variates. This measure was multiplied by —1 to 
give a quantity, fp, that is positively related to mimetic fidelity. 
Syrphids and hymenopterans tend to cluster with their respective 
orders, largely on the basis of relative length of antennae (Lant), a 
feature that is thought to be used in discrimination by birds'®. The 
multimodel hypothesis’ would predict that mimics fall between several 
models, gaining greater benefit from multiple, weaker associations. 
Our analysis clearly indicates that there are no mimetic phenotypes 
in our sample that could be considered to be falling morphologically 
between two or more distinct model phenotypes (Fig. 1). 

The eye-of-the-beholder hypothesis recognizes that the natural 
predators of mimics (such as birds) and humans differ in both their 
perception and cognitive abilities, and argues that the apparent vari- 
ation in mimetic perfection is therefore illusory and/or misleading. 
However, our multivariate measure of mimetic fidelity correlated well 
with our human ranking (r= 0.555, d.fi=36, P<0.001; Fig. 2), 
whereas similar work indicates that trained pigeons rank mimetic 
fidelity of hoverfly species in much the same way as humans do 
(Supplementary Fig. 2). Additional results suggest that behavioural 
mimicry (antenna waving, mock stinging, wing wagging) occurs only 
in species that humans classify as high-fidelity mimics (H.D.P., C.H., 
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Figure 1 | GCDA plot of biometrical measurements. Circles are models 
(hymenopterans); triangles are mimics (syrphids); grey symbols are wasps or 
wasp mimics. Open symbols are bees or bee mimics, and the solid black circle is 


J.H.S. and T.N.S., unpublished observations), further suggesting that 
the human-based quantification of mimetic perfection is ecologically 
relevant. Taken together, these findings suggest that we can discount 
the eye-of-the-beholder hypothesis as an explanation for interspecific 
variation in hoverfly fidelity. 

It has been stated frequently that, within hoverflies at least, poor 
mimics tend to occur at higher population densities than good 
mimics**"*, Although this relationship is plausible, it remains entirely 
anecdotal. The kin-selection hypothesis for imperfect mimicry not only 
assumes a degree of family grouping (which is unlikely in hoverflies*) 
but also predicts that the evolved degree of mimetic perfection will be 
lowest when mimics are relatively common and/or relatively beneficial 
to attack in comparison with their models’. This prediction arises from 
the assumption that predators that are largely unable to distinguish 
mimics from models will sample those high-fidelity mimics at a greater 


rate when the incentive to attack is greater. The constraints'*'* and 
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Figure 2 | The relationship between Mahalanobis distances (fp) and 
human rankings (f;x) measures of mimetic fidelity in hoverflies. The line 
shows the fitted linear regression. Filled triangles, wasp mimics; open triangles, 
bee mimics; filled circle, the non-mimetic syrphid Cheilosia vernalis. 
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the non-mimetic syrphid Cheilosia vernalis. Species codes can be found in 
Supplementary Table 2; other abbreviations are described in Methods. 


relaxed-selection”*® hypotheses predict the opposite, namely that the 
evolved degree of perfection will be highest when mimics are relatively 
common and/or relatively beneficial to attack as a result of the 
increased selection pressure to avoid predation through mimicry. A 
meta-analysis of 11 independent studies (Supplementary Table 1) 
demonstrated no evidence of a strong correlation between relative 
abundance and either measure of mimetic fidelity in our 38 focal 
species, either before controlling for phylogenetic autocorrelation 
(furs T = 0.065 (95% confidence interval (CI), —0.052 to 0.181); fiw, 
F = 0.001 (95% CI, —0.149 to 0.152)) or after controlling for phylogeny 
by using phylogenetic generalized least-squares regression for a subset 
of 31 species (Fig. 3) (fir, FT = —0.083 (95% CI, —0.031 to 0.198); fim, 
F = 0.223 (95% CI, 0.058-0.389)). Note that this latter correlation 
between fp and abundance was significant (the 95% CI did not over- 
lap zero) and positive, indicating that in this case poor mimics tend to 
be less common. Therefore, on the basis of our sample of species (that 
is, those sufficiently common to appear regularly in systematic field 
surveys) we found no evidence that good mimics tend to be rarer 
(Supplementary Figs 3 and 4). 

The constraints'*'* and relaxed-selection”* hypotheses suggest that 
smaller-bodied (and thus less nutritionally profitable’’) species will 
endure low levels of predation, even if they are poor mimics, which 
produces weaker selection for improved mimicry; the kin-selection 
hypothesis predicts the opposite (see above). Therefore the relation- 
ship between the body size of hoverfly species and their mimetic fidelity 
was assessed. There was a highly significant relationship between body 
size (principal component 1; —PC1) and fp both before controlling 
for phylogeny (Pearson’s correlation, r = 0.680, d.f. = 36, P< 0.001) 
and after controlling for phylogeny (phylogenetic generalized least- 
squares regression, t = 4.693, P<0.001). The relationship between 
body size and fp was also highly significant before controlling for 
phylogeny (r = 0.632, d.f. = 36, P< 0.001) and after controlling for 
phylogeny (t = 3.005, P = 0.005); see Fig. 4. Finally, our meta-analysis 
indicates that the relative abundance of hoverfly species was only 
weakly (negatively) correlated with their body size before controlling 
for phylogeny (7 = —0.132 (95% CI, —0.239 to —0.024)) and after 
controlling for phylogeny (7 = —0.240 (95% CI, —0.476 to —0.005)), 
and it is clear that this relationship is not consistent across studies 
(Supplementary Fig. 5). 
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Figure 3 | Phylogenetic relationships between the 31 hoverfly species 
(Diptera: Syphidae) for which fidelity and body-size data exist. Shaded bars 
show the species’ mimetic fidelity (fj7x), and open bars show the species body 
size (—PC1). In both cases, the length of the bars is scaled from smallest to the 
largest score for the species shown. The model for each hoverfly species is 
shown at the right (W, wasp; B, bee). See the text for details of the tree 
construction methods, and Supplementary Fig. 6 for the phylogeny of all 81 
species for which the cytochrome oxidase c subunit I (COI) gene has been 
sequenced. 


The constraints and relaxed-selection hypotheses both suggest that 
larger, more profitable species will tend to achieve a higher degree of 
mimetic fidelity at equilibrium, as a result of the greater underlying 
incentive on predators to attack them. Other factors influencing 
profitability might include evasive flight behaviour, but predation of 
hoverflies by birds takes place largely on flowers’. Of course, the 
predicted evolutionary trajectory for small and large mimics is less 
obvious if larger-bodied species are rarer than smaller-bodied 
species'**°; however, as noted above, there is no consistent evidence 
that this is true. Similarly, our expectation that selection might be less 
intense on small species because they are less valuable prey might not 
hold if predators rely heavily on body size as a trait to distinguish 
mimics from models” (hymenopteran models tend to be larger). 
Thus, if predators were highly sensitive to difference in body size, then 
depending on conditions there might be a complete relaxation of selec- 
tion on mimicry in small species (because they are always attacked 
regardless of mimetic fidelity), leading to the same general outcome 
we have observed, or even stronger selection mimetic fidelity in small 
species as a way to counteract size-based discrimination. Nevertheless, 
there is evidence that predators cannot discriminate perfectly between 
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Figure 4 | Relationship between body size (estimated as —PC1) and two 
measures of mimetic fidelity. a, A measure of mimetic fidelity based on 
Mahalanobis distances (fp). b, Human ratings of mimetic fidelity (fizz). Lines 
show the fitted linear regressions. Filled triangles, wasp mimics; open triangles, 
bee mimics; filled circle, the non-mimetic syrphid Cheilosia vernalis. 


mimics and models on the basis of body size’, and so our profitability 
interpretation seems the most reasonable. 

Mimicry provides a textbook example of adaptation, but researchers 
have long debated why the fidelity of many imperfect mimics is not 
further improved by natural selection. Our study represents the first 
attempt to evaluate multiple hypotheses for imperfect mimicry in the 
group best known for it, and the first to reveal a significant pattern. Of 
the five primary hypotheses that we evaluated, our comparative study 
is only consistent with the constraints and relaxed-selection hypotheses 
while questioning the assumptions and predictions of the kin-selection, 
eye-of-the-beholder and multimodel hypotheses. Our revelation of a 
strong positive relationship between body size and mimetic fidelity is 
readily explained if body size influences predation behaviour and 
thereby the intensity of selection for more perfect mimicry. Indeed, 
relationships between body size and the evolution of aposematic 
coloration in dendrobatid frogs” and putative snake mimicry in 
Lepidoptera” may also be explained by the kinds of processes we 
have invoked. The fact that we can explain the variation in mimetic 
fidelity on the basis of a relaxation of selection on mimetic fidelity 
alone, without the need to invoke a specific constraint to generate 
counter-selection, renders the relaxed-selection hypothesis entirely 
sufficient to explain the variation we have documented, but we cannot 
discount the constraints hypothesis. Whether the patterns we have 
revealed are observed in other mimicry complexes remains to be seen. 
However, it is clear that the comparative method will be important in 
evaluating the various explanations not only for imperfect mimicry 
but for limits to adaptation in general. 
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METHODS SUMMARY 


We focused on 35 hoverfly species recorded in the most extensive multiannual 
abundance data set available** (see Supplementary Table 2) and three additional 
high-fidelity mimics. Phenotypic characters relevant to avian perception were 
measured from photographs of the 38 species. These characters included antenna 
length (Lant), abdomen length (L,»), abdomen width (W,,), thorax width (Wi), 
wing length (Lwing), all expressed relative to head width (Wyeaa). The mean red, 
green and blue colours (R, G and B) of the abdomen and associated standard 
deviations (sdz, sdg and sdg) were estimated. Finally, the number and colour of 
patches and/or stripes were recorded (Stripe, Patch, StripePatchCol). The 
Mahalanobis distances between species’ centroids when these multivariate data 
were plotted in multivariate space gave a measure of mimetic fidelity, fyyp, and a 
principal components analysis of linear dimensions gave a measure of body size 
(—PC1, because the first principal component was negatively correlated with size). 
A sample of the photographs, all dorsal view and of the same apparent size, 
representing two different individuals of each of the 38 syrphid species, were rated 
by human volunteers (m= 21) for similarity (from 1 (very poor mimic) to 10 
(excellent mimic)) to photographs of a wasp, honeybee and bumblebee. The highest 
average rating provided a second measure of mimetic fidelity, fur. 

We calculated the relative abundance of our species from 11 independent 
studies on the basis of a range of trapping and survey methods. The COI gene 
was sequenced for 77 species of 21 genera of Syrphidae, including 31 species for 
which we have morphological data, and phylogenetic trees were constructed by 
using Bayesian methods. Relationships between variables were evaluated with 
control for phylogenetic autocorrelation (using phylogenetic generalized least- 
squares regression) and without control for phylogenetic autocorrelation (using 
Pearson correlations). For the abundance analysis, weighted-mean correlations (7) 
and 95% confidence intervals were calculated across all 11 studies. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Mimetic fidelity. Specimens. We focused on the hoverfly species recorded in the 
most extensive (more than 40,000 specimens) multiannual (15 years) abundance 
data set available**; see Supplementary Table 2 for a species list. We took photo- 
graphs of the dorsal and lateral aspects of pinned specimens of 35 species that were 
included in this focal data set and present in sufficient numbers at the Canadian 
National Collection of Insects and Arachnids, Ottawa, Canada. Three additional 
hoverfly species were included to increase the number of high-fidelity mimics, and 
ten hymenopteran species representing a broad array of potential models (vespid 
wasps, polistine wasp, honeybee and bumble bees) were also photographed. 
Photographs were taken of ten individuals (five males, five females) for each of 
the 38 syrphid species, and ten individuals of each hymenopteran species (all 
female). All the photographs were taken with a Canon EO5 50D camera with 
macro lens (100 mm) and microtwin light (MT-243X). 

Human rankings. A sample of the photographs, all dorsal view, representing 
two different individuals of each of the 38 syrphid species were collated. Human 
volunteers (n = 21) were shown each photograph in random order on a projector 
screen for 20s, alongside the same images of a wasp (Vespula vulgaris), honeybee 
(Apis mellifera) and bumblebee (Bombus impatiens). Each hoverfly and model 
image was presented at magnifications such that they had the same projected body 
length. Human subjects were asked to rank each syrphid on a scale of 1 (very poor 
mimic) to 10 (excellent mimic) for each of the three potential models (wasp, 
honeybee and bumble bee). The human rank of mimetic fidelity identified the 
model type to which the potential mimic bore the closest resemblance (based on 
overall mean score for images of that species) and provided a measure of mimetic 
fidelity, fur. 

Multivariate ratings. A range of attributes were extracted from individual 
photographs of specimens of the syrphids and hymenopterans described above, 
using ImageJ (http://imagej.nih.gov/ij/). Specimen phenotypic characters were 
selected on the basis of their relevance to avian perception (taken from ref. 16). 
Body dimensions comprised antenna length (Lay,), abdomen length (L,,), 
abdomen width (W,»), thorax width (Win) and wing length (Lying), all expressed 
relative to head width (Wy.aq). Mean red, green and blue colours (R, G and B) and 
the standard deviation of RGB (sdg, sdg, sdg) of the abdomen were also measured 
with Corel PhotoPaint X3. Finally, the number and colour (classified by the 
observer as white, grey, silver, yellow, orange or brown) of patches and/or stripes 
were also recorded (Stripe, Patch, StripePatchCol). Note that our RGB estimates 
were unstandardized and we did not transform our measurements into avian (or 
any other potential predator’s) colour space. However, our estimates do capture 
(albeit coarsely) some of the variation in colour that is likely to be perceived by 
humans, birds and insects. Moreover, we note that the relationship we presented 
between human and multivariate measures of mimetic fidelity (see the text) 
remains significant (albeit weaker) even when we remove RGB and the number 
of stripes and patches, retaining only measurements of the physical dimensions of 
specimens (r = 0.369, d.f. = 36, P = 0.023). 

AGCDA was used to identify combinations of variables that serve to discriminate 
between one or more groups of data on the basis of differences between them using 
the candisc library” in R (ref. 26). Each potential mimic species is represented by a 
centroid in multivariate space representing the mean GCDA variates of the indi- 
vidual specimens of that species. Typically, each model species centroid clustered 
closely around its taxonomic group centroid. The distance between species’ 
centroids (the Mahalanobis distance) represents a multidimensional measure of 
the morphological similarity between species. Although we cannot confidently 
assign specific hymenopteran models to each mimic, it is possible to distinguish 
bee mimics from wasp mimics. Therefore, the average of the distances between 
each mimic centroid and the centroids of the bee and wasp models was calculated 
to give a score for mimetic fidelity for each mimic to its putative model group. 
The smallest mean Mahalanobis distance from the hoverfly species to its 
potential model type (bee or wasp) again provided a measure of mimetic fidelity. 
Because Mahalanobis distances were negatively related to mimetic fidelity, we 
invert those distances to give a measure of mimetic fidelity that is easier to 
interpret: fuup. 

Quantification of body size. The body size of each of the 38 syrphid species 
photographed was quantified by conducting a principal components analysis on 
body dimensions (expressed in absolute terms) mentioned above. The first 
principal component (PC1) explained 82.9% of the variation in the data, was 
strongly negatively correlated with all six body dimensions (r<—0.81, 
P<0.001 in all cases), and was taken as a composite measure of body size. 
Because PC1 was negatively correlated to body size measures, we refer to 
—PC1, which is positively correlated with body size, in the results as an aid to 
interpretation. 

Abundance. We identified 11 independent studies that provide estimates of rela- 
tive hoverfly abundance based on a range of trapping and survey methods (see 
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Supplementary Table 1 for full details). These studies included between 3 and 35 
species for which we had detailed morphological data (hence estimates of mimetic 
fidelity). Where multiple years of data were presented, this was based on the 
arithmetic mean count of each species trapped per year over the whole period 
of recording. These data were considered reasonably indicative of abundance, as 
there was a general tendency for the relative abundance of hoverfly species to 
correlate between studies based on a range of trapping methods (see Supplemen- 
tary Table 3). 

Phylogenetic analysis. A total of 77 species of 21 genera of Syrphidae are included 
in the ingroup (Supplementary Table 4), including 31 species for which we have 
morphological data (Fig. 3 and Supplementary Table 2). To encompass a range of 
genetic variation, we included multiple exemplars from each genus used in our 
mimicry analysis. These specimens represent most major syrphid clades. Four 
species of Microdon, the putative sister group to the rest of the syrphids*”**, were 
used as outgroup taxa. 

The 5’ region of the gene encoding cytochrome oxidase c subunit I (COI) was 
sequenced for each specimen in accordance with the methods outlined in ref. 29. 
DNA extraction and sequencing were performed in house and at the Canadian 
Centre for DNA Barcoding. The resultant sequences, as well as images and related 
data, can be accessed through the Barcode of Life Data Systems (BOLD) (http:// 
www.barcodinglife.org/) in the public project “Mimicry—Skevington (MIMSK)’ 
(http://www.boldsystems.org/views/projectmenu.php?&). In addition, all sequences 
were deposited in GenBank (Supplementary Table 4). 

Bayesian analyses were conducted with MrBayes 3.1.2 (ref. 30) with a Markov 
Chain Monte Carlo (MCMC) method as submitted remotely to the CIPRES 
computing cluster (http://www.phylo.org/)*?. MrModelltest v2.3 (ref. 32) was used 
to determine the best model (GTR+1+G) for analysis. Four chains (three hot, one 
cold) were run simultaneously for 5 million generations. Trees were sampled every 
1,000 generations and each simulation was run twice. At 5 million generations the 
standard deviation was 0.03195. After the discard of the first 500,000 samples as 
burn-in, 9,002 trees were used to generate a majority-rule consensus tree, posterior 
probabilities for each node, and branch length estimates. The resulting phylogeny 
is largely congruent with other published results that used nuclear loci*” and 
contains multiple taxa per genus, which acts as a major control for the single 
mitochondrial marker. Reassuringly, the resulting phylogeny supported 
monophyly for all genera apart from identifying two paraphyletic genera that have 
been suggested as such in the literature (Cheilosia**** and Eupeodes™; Supplemen- 
tary Fig. 6). Finally, using additional 28S sequences for a subset of 15 species (see 
Supplementary Table 4 for accession numbers), we find a significant correlation 
between the phylogenetic distances of the COI+ 28S and COI-only trees (Mantel 
test for phylogenetic distances, r = 0.680, P< 0.0001), suggesting that our COI 
phylogeny is an adequate representation of the phylogenetic relationships between 
the species in our analysis. 

Relationships between variables were first evaluated without control for 
phylogenetic autocorrelation by using Pearson correlations. Autocorrelation was 
incorporated into a second analysis by using the corGrafen function in the ape 
library’? in R (ref. 26) to create a covariance matrix based on species from the tree 
structure described above. The gls function in the nlme package*® was then used 
to perform the test. For the abundance analysis, Pearson correlations 
between variables were used in the MetaTable function in the psychometric 
package” in R to calculate the weighted-mean correlation, 7, for all 11 studies. 
This statistic does not have an associated P value, but 95% confidence intervals for 
the coefficient were calculated and can be used to determine significance. To 
control for phylogenetic autocorrelation, f-statistics from the generalized least- 
squares regression analyses of abundance were converted to Pearson correlation 
coefficients, where r = t/(n — 2 + f°), followed by the calculation of the weighted- 
mean correlation. 
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MEGF10 and MEGF11 mediate homotypic interactions 
required for mosaic spacing of retinal neurons 


Jeremy N. Kay', Monica W. Chu' & Joshua R. Sanes! 


In many parts of the nervous system, neuronal somata display 
orderly spatial arrangements’. In the retina, neurons of numerous 
individual subtypes form regular arrays called mosaics: they are 
less likely to be near neighbours of the same subtype than would 
occur by chance, resulting in ‘exclusion zones’ that separate 
them’~*. Mosaic arrangements provide a mechanism to distribute 
each cell type evenly across the retina, ensuring that all parts of the 
visual field have access to a full set of processing elements’. 
Remarkably, mosaics are independent of each other: although a 
neuron of one subtype is unlikely to be adjacent to another of 
the same subtype, there is no restriction on its spatial relationship 
to neighbouring neurons of other subtypes’. This independence 
has led to the hypothesis that molecular cues expressed by specific 
subtypes pattern mosaics by mediating homotypic (within-subtype) 
short-range repulsive interactions'*°. So far, however, no molecules 
have been identified that show such activity, so this hypothesis 
remains untested. Here we demonstrate in mouse that two related 
transmembrane proteins, MEGF10 and MEGF11, have critical roles 
in the formation of mosaics by two retinal interneuron subtypes, 
starburst amacrine cells and horizontal cells. MEGF10 and 11 
and their invertebrate relatives Caenorhabditis elegans CED-1 
and Drosophila Draper have hitherto been studied primarily as 
receptors necessary for engulfment of debris following apoptosis 
or axonal injury'’ *. Our results demonstrate that members of this 
gene family can also serve as subtype-specific ligands that pattern 
neuronal arrays. 

The retina contains over 70 neuronal subtypes, divided into broad 
categories of photoreceptors, interneurons and retinal ganglion cells’. 
To seek molecules involved in cell-cell recognition events during 
retinal circuit assembly, we purified 13 subtypes of retinal neurons 
from transgenic mice and used microarrays to inventory the genes 
they expressed'*’”. We collected cells at postnatal day (P)6, a time at 
which synapse formation and mosaic refinement are underway. From 
this data set we identified genes selectively expressed by specific sub- 
types, including starburst amacrine cells (SACs), an interneuronal 
subtype that has critical roles in motion detection’®. The approxi- 
mately 100 genes that met our criteria for selective SAC expression 
included most known SAC markers as well as potential new markers 
(Fig. la and Supplementary Table 1). Thirty-one of the novel genes 
were tested by in situ hybridization or immunohistochemistry in com- 
bination with SAC markers such as choline acetyltransferase (ChAT) 
or calbindin’’. This secondary screen yielded 26 genes (Supplementary 
Table 1), among which were two homologues, Megf10 and Megf11. 
These genes encode transmembrane proteins with multiple epidermal 
growth factor (EGF)-like domains, a single membrane-spanning 
region, and a cytoplasmic domain with several binding sites for signal 
transduction components'*”’ (Supplementary Fig. 1). At P5-6, both 
genes were strongly expressed in SACs (Fig. la-c). Both were also 
expressed in horizontal cells (HCs), which were not part of the data set. 

SACs are present in both the inner nuclear and ganglion cell layers, 
and form independent mosaics in each°. These mosaics develop during 
late embryonic stages, as newborn SACs migrate from the site of their 


birth, the outer neuroblast layer, to their final laminar locations. SACs 
begin to exhibit mosaic spacing upon arrival at their destinations, 
presumably due to contacts with their homotypic neighbours*’. As 


a 
= 
2 
no 
3 
io 
x 
oO 
oO 
= 
& 
oO 
a 
SG © GCG © YF GO A HK Ww MDM OC LO ww 
PP ORM RM GP MD eG WM LW SH ww ow WM 
io) ge y & SC ‘S) > 
c 
° Megf10 Megf10 Calb Megf10 


Mol INL 


d Megf10 Niss| f 


Islet 1 


oS ‘ 
ie 
fo} 
lu 
= 
& 
= 
) 
oO 
i 
o} 
wu 
= 


Figure 1 | Expression of Megf10 and Megf11 in SACs and HCs. a, Relative 
expression level of Megf10, Megf11 and the known SAC markers Chat, Calb1 
(encoding calbindin) and Js/1 (islet 1) in 13 amacrine (green), bipolar (blue) and 
retinal ganglion cell (black) subtypes analysed with microarrays. Level in SACs 
set to 1 for each gene. Isl was also detected in ON bipolar cells as previously 
reported”’. Abbreviations for subtypes are defined in Supplementary Table 2. 
b, Schematic of retina. INL, inner nuclear layer with horizontal, bipolar, 
amacrine, and Miiller glial cells; IPL, inner plexiform layer with synapses 
among bipolar, amacrine and ganglion cells; GCL, ganglion cell layer including 
displaced amacrine cells; ONL, outer nuclear layer containing photoreceptors; 
OPL, outer plexiform layer with photoreceptor synapses. SACs (red), HCs 
(green) and Miiller cells (dark grey row) are indicated. ¢, In situ hybridization 
for Megf10 (left panel; red in right panel) combined with anti-calbindin 
immunohistochemistry (green in right panel) at P5 shows expression in SACs 
(black arrows) and HCs (arrowheads). d, Megf10 (red) in E16 retina. Islet 1 
(green) marks SACs migrating through the outer neuroblast layer (ONbL) and 
in the INL’. Megf10 (red) is expressed by migrating SACs as they arrive in the 
INL (arrowhead), but not at earlier stages of their migration through the ONbL 
(arrows). e, Megf10 expression appears in Miiller glia (Mii) and is lost from 
SACs and HCs by P14. f, Double-label immunostaining for MEGF10 (green) 
and ChAT (red). MEGF10 protein localizes to the somata and processes of 
developing SACs (also see Supplementary Fig. 1). Scale bars, 20 [1m (c-e) or 
10 ttm (f). 
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new SACs are added to the array, local cellular rearrangements maintain 
mosaic spacing'*’. Mosaic spacing is maintained even as SAC dendrites 
grow to overlap with those of their neighbours”; thus, mosaicism is 
distinct from the phenomenon called tiling, which minimizes dendritic 
overlap’. Co-staining at embryonic day (E)16 with the early SAC 
marker islet 1 (refs 9, 21) demonstrated that newborn SACs activated 
Megf10 expression as they finished their migration and became inte- 
grated into mosaics (Fig. 1d). Megf10 expression persisted in SACs 
from E16 to the first postnatal week, and began in HCs at PO (Sup- 
plementary Fig. 1 and data not shown). In the second postnatal week, 
Megf10 was downregulated in neurons but appeared in Miiller glia 
(Fig. le), consistent with previous reports that Megf10 is expressed 
by brain glia’. To determine the subcellular localization of MEGF10 
in neurons, we generated an antibody to recombinant protein (Sup- 
plementary Fig. 1). MEGF10 was present both on the somata of SACs 
and HCs and on their processes (Fig. 1f and Supplementary Fig. 1). As 
expected, immunoreactivity levels on SACs and HCs were highest 
during the first postnatal week, and then declined (data not shown). 
Thus, Megf10 is expressed by SACs and MEGF 10 protein is present on 
SAC processes during the time that mosaics form. Because Megf11 
expression was not observed in retina until after SAC mosaics had 
formed (see later), we focused first on Megf10. 

To investigate whether MEGF10 is required for SAC development, 
we generated mutant mice (Supplementary Fig. 2). Megfl0 mutants 
were viable and fertile and their retinas exhibited no gross abnormalities. 
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Mutant SACs migrated to the inner nuclear and ganglion cell layers as in 
controls, they were present in normal numbers, and their dendrites 
projected to appropriate sublaminae of the inner plexiform layer 
(Fig. 2a—-c). Examination of whole mounts showed, however, a marked 
loss of regular spacing among SAC somata, suggesting that their mosaic 
arrangement had been disrupted (Fig. 2a and Supplementary Fig. 3). 
To assess the degree to which SAC mosaics were disrupted, we 
measured the exclusion zone—the region surrounding each SAC in 
which other SACs are rarely found. This parameter is calculated from 
the density recovery profile, a plot of cell density as a function of 
distance from each SAC in the array”* (Fig. 2d). The SAC exclusion 
zone in mutants was smaller than in wild-type littermates, and was 
approximately equal to the diameter of a SAC soma (Fig. 2e). As the 
only limitation on proximity was soma size, SACs seem to be posi- 
tioned randomly in Megf10 mutants. This conclusion was supported 
by two additional measurements of spatial order: the packing factor”, 
another index of regularity calculated from the density recovery pro- 
file; and an independent measure, the Voronoi domain regularity 
index, which quantifies variations in the area of the territories nearest 
to each cell in an array*”**. In each case the index calculated for SAC 
arrays in Megf10 mutants was similar to that measured for computer- 
generated random arrays, whereas SAC arrays from wild-type littermates 
were highly ordered (Fig. 2f, g); heterozygotes showed a mild disorgan- 
ization (Supplementary Fig. 4). Together these results suggest that 
MEGF10 acts in SACs to impose a minimal intercellular spacing; in 
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Figure 2 | Loss of SAC mosaic spacing in Megf10 mutant mice. a, SAC 
mosaic in inner nuclear layer (INL) of wild-type (left) and Megf10~ (centre) 
retina, revealed by whole-mount staining with anti-ChAT. Wild-type mice 
have evenly spaced SAC somata, whereas mutants exhibit clumps and gaps 
similar to those seen in a simulation of a random cellular array (right). See 
Supplementary Fig. 3 for similar results in ganglion cell layer (GCL) SACs. 

b, ChAT-stained retinal sections from wild-type and Megf10 mutant animals. 
Laminar positions of SAC somata and processes are normal in mutants, even in 
regions where somata are clumped. c, Density of SACs and VGLUT3* 
amacrine cells is similar in wild-type (+/+) and Megf10 mutant (-/-) retina. 
d, Density recovery profiles (DRPs) for the SAC (INL) and VGLUT3* 
amacrine arrays. Graphs show the density of cells ina ring of radius x, relative to 
the density of cells in the image as a whole. Dashed line, DRP of random point 
array. The exclusion zone characteristic of mosaic spacing is measured as a dip 
below this line. e, Exclusion zone radius measured from d. Dashed line, 
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expected result for an array of cells distributed randomly; that is, the diameter of 
a single cell. Increases above this minimum indicate spatial order. The mutant 
SAC exclusion zone radius was similar in size to a SAC cell diameter, and was 
significantly smaller than wild type (*P < 0.0001). VGLUT3~ amacrine 
exclusion zones were unaffected. f, g, SAC packing factor (f) and Voronoi 
domain regularity index (g) were significantly lower in Megf10~ mice than in 
wild-type littermates (*P < 0.0001). Dashed line, mean for arrays of cells 
distributed randomly. Wild-type SACs and VGLUT3 © cells of both genotypes 
were non-randomly arrayed. The SAC array in mutants was not significantly 
different from random arrays (f, P = 0.16; g, P = 0.48). h, Morphology of single 
GCL SACs, labelled with adeno-associated virus driving membrane-targeted 
Cherry fluorescent protein, showed no gross abnormalities in Megf10 mutants 
(n= 8 cells each genotype). Data from P15 (a—g) or P80 (h) mice. Scale bars, 
50 um. Error bars, s.e.m. 
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its absence SACs assume random positions relative to each other. By 
contrast, other amacrine cell types examined (VGLUT3- (also known 
as SLC17A8) and tyrosine hydroxylase-positive) as well as bistratified 
direction-selective retinal ganglion cells, which are prominent synaptic 
targets of SACs'*, showed normal mosaic spacing (Fig. 2d-g and 
Supplementary Fig. 3). 

In mice lacking the adhesion molecules DSCAM or DSCAMLI, 
certain retinal cell mosaics form normally, then degrade secondary 
to hyperfasciculation of their neurites*’°. To investigate whether 
MEGF10 acts in a similar indirect fashion, we labelled individual 
SACs in mutants. Lack of MEGF10 had no obvious effect on SAC 
dendritic morphology (Fig. 2h). Moreover, soma disorganization 
was already evident by PO (Supplementary Fig. 5), shortly after mosaics 
form, indicating that MEGF10 affected formation of SAC mosaics. 

Next we asked whether MEGF10 is also essential for formation of 
the HC mosaic. Loss of MEGF10 had only a modest effect on HC 
regularity (Fig. 3). We therefore considered that MEGF11 might have 
a redundant role. Megf11 was not expressed in embryonic retina, but it 
appeared in HCs and SACs during the first postnatal week and per- 
sisted into adulthood (Fig. 3a—d). Importantly, SAC mosaics , to 
form before P0, but HC mosaics are established postnatally**’. W 
therefore generated Megfl1 mutant mice (Supplementary Fig. 2), 
which, like Megfl10 mutants, showed no gross retinal abnormalities 
(Supplementary Fig. 6). Examination of HC arrays in Megf11 mutants 
revealed a modest decrease in regularity similar to that in Megf10 
mutants, whereas in Megf1 ov; Megfl1— double-mutant animals, 
the HC mosaic was severely disrupted (Fig. 3e-m and Supplemen- 
tary Fig. 7). By contrast, SACs were unaffected by loss of Megfl1 
function, and were no more affected in double mutants than in 
Megf10 single mutants (Supplementary Figs 6 and 8). Thus, Megf11 
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is dispensable for SAC arrangement, but acts together with Megf10 to 
shape the HC mosaic. 

To elucidate the cellular mechanisms by which Megf10 and Megf11 
act, we used a gain-of-function approach in which we introduced 
MEGF10 into the retina by electroporation of plasmid DNA at PO. 
We first tested the hypothesis that MEGF10 can act as a signal that 
repels SACs, creating the exclusion zone that defines mosaic spacing. 
The electroporation method predominantly transfects dividing cells, 
leading to expression in neurons that exit the cell cycle postnatally, 
such as bipolar cells, Miiller glia, photoreceptors and late-born amacrine 
cells’. Because SACs and HCs are born embryonically, they are rarely 
transduced”, allowing us to surround wild-type SACs and HCs with 
cells ectopically expressing excess MEGF10. Indeed, electroporation of 
plasmid encoding a fluorescent protein (FP) or a MEGF10-FP fusion 
produced retinal patches in which a large fraction of neurons, but no 
SACs or HCs, were FP* (Fig. 4a and Supplementary Fig. 9). Expression 
of MEGF10-FP (but not FP alone) resulted in exclusion of SACs and 
HCs from a swath at the edge of the electroporated patch, whereas 
spacing of these cells was essentially normal in patch centres (Fig. 4a, 
b and Supplementary Fig. 10). A truncated MEGF10 lacking the 
cytoplasmic domain produced an identical phenotype (Supplemen- 
tary Fig. 10), ruling out the possibility that MEGF10 acts indirectly 
by triggering production of a repellent factor by the transfected cells. 
Thus, MEGF10 can act as a ligand that signals to SACs and HCs. 

Because MEGF10 relatives have been implicated in cell engulfment'?”’, 
we asked whether its overexpression formed an exclusion zone by 
eliminating cells. We found no evidence for cell death at patch edges 
(no pyknotic nuclei or activated caspase-3 immunoreactivity; data not 
shown). Instead, cell density was increased at the outer edge of the 
cell-free swath, indicating that SACs had exited the patch to create the 
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Figure 3 | HC mosaic spacing requires Megf10 and Megf11. a-d, In situ 
hybridization for Megf11 at ages indicated. Megf11 (red) was not expressed at 
E16 (a). Calbindin immunostaining (green) labels SACs and HCs, which co- 
express Megf11 at P5 (b), P7 (c) and P14 (d). See Fig. 1 for abbreviations. 
e-i, Retinal whole mounts stained for calbindin to reveal the HC array. In wild- 
type mice (e), HCs are distributed evenly. Megf10~~ mutants (f) and Megf11~~ 
mutants (g) show subtle changes in the regular spacing of HCs, whereas double 
Megf10”-; Megf11- mutants (h) show marked HC disorganization similar to a 
simulation of a random HC array (i). j-m, Quantification of HC spacing 
regularity in Megf10’~ (red) or Megfll ‘~ (green) single mutants; Megfl0~ ; 


Megf11 ~~ double mutants (blue); and wild-type (WT) siblings (grey). In all 
genotypes, HCs were present at normal density (j) but were less regularly 
spaced relative to wild type based on exclusion zone radius (k), packing factor 
(1) and Voronoi domain regularity (m) measurements as in Fig. 2. Double 
mutants showed significantly less order than single Megfl0”” or Megfl 1” 
mutants and approach random arrangement, indicated by dashed lines 

(k, mean HC soma diameter; 1, m, computed values for random arrays). P 
values: *P < 0.01, **P < 0.001, ***P < 0.0001. NS, not significant. Error bars 
give s.e.m. Data in e-m from P15 animals. Scale bars, 20 um (a-d; b, c share 
scale) or 50 um (e-i). 
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Figure 4 | MEGF10 acts as both ligand and receptor to trigger SAC 
repulsion. a, A retinal patch transfected by electroporation with plasmid 
encoding MEGF10-FP fusion protein, viewed in flat mount. SAC somata in 
INL, stained with anti-ChAT, are excluded from a swath at the patch edge. 
SACs are evenly spaced elsewhere, except where the retina was pierced to inject 
DNA (Inj. site). b, Higher-magnification views of patch edges. FP 
misexpression (left) did not affect SAC spacing, but MEGF10-FP (right) 
produced a SAC-free zone just inside the transfected region and induced 
apparent crowding of SACs immediately outside it (arrows). Dashed line, patch 
edge. See Supplementary Fig. 11 for quantification of cell distribution at patch 
edges. c, Hypothesis for MEGF10 function based on a, b. In wild-type retina 


SAC-free zone (Fig. 4b and Supplementary Fig. 11). Our interpretation 
is that SACs and HCs are repelled by MEGF10, but that within 
MEGF10-FP patches the concentration of MEGF10 is similar in all 
directions, resulting in no net movement (Fig. 4c). Consistent with this 
view, SACs and HCs were entirely excluded from small patches (=5-6 
cell diameters; Supplementary Fig. 10). Together these results suggest 
that MEGF10 can act as a repellent ligand. 

We next asked which cells are sensitive to MEGF10 ligand. In 
Megf10 mutants, SACs and HCs are selectively affected; this might 
be because they are uniquely sensitive to MEGF10 or because only 
these cells encounter endogenous MEGF10 at high concentration. 
Ectopic expression allowed us to distinguish between these possibilities. 
None of 13 other amacrine, bipolar or retinal ganglion cell subtypes 
assayed with cell-type-specific markers was detectably affected by 
MEGF10-FP (Fig. 4d and Supplementary Table 3). Thus, MEGF10 
appears to act asa cell-surface ligand fora receptor specifically expressed 
by SACs and HCs. 

This led us to ask whether MEGF10 might be the MEGF10 receptor. 
We used cultured epithelial cells (HEK293) to seek evidence for a 
MEGF10-dependent homotypic interaction. Whereas FP-expressing 
cells overlapped, MEGF10-FP-expressing cells formed sharp borders 
with narrow gaps (Supplementary Fig. 12a), consistent with previous 
results’. This ‘jigsaw’ pattern reflected suppression or elimination of 
filopodia at points of intercellular apposition (Supplementary Fig. 12b). 
Jigsaw formation and loss of filopodia required the MEGF10 cytoplasmic 
domain and did not occur when a MEGF10-FP-expressing cell con- 
tacted an untransfected or FP-transfected cell (Supplementary Fig. 
12b, c). Although we have been unable to demonstrate MEGF10 
homophilic binding using biochemical methods (data not shown), 
these results suggest that a MEGF10-containing signalling complex 
mediates a homotypic interaction resulting in intercellular repulsion. 

Finally, we tested the idea that MEGF10 serves as both ligand and 
receptor in vivo. To this end, we electroporated MEGF10 into Megf10 
mutant retina. Mutant SACs did not exit MEGF10-FP patches (Fig. 4e 
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(top), SACs use MEGF10 as a ligand to signal their location to neighbouring 
SACs. Cell 3 positions itself at the point where repulsive signals on either side 
(from Cell 1 and Cell 2) are equal. In MEGF10-FP overexpression (bottom), 
that location is now outside the patch. d, MEGF10-FP transfected whole- 
mount immunostained for ChAT and VGLUTS3. Dashed lines mark SAC-free 
zone (see Supplementary Fig. 10). VGLUT3* amacrine cells are present in this 
zone. e, MEGF10-FP is incapable of generating a SAC-free zone when 
misexpressed in Megf10 mutant retina, indicating that MEGF10 is needed for 
SAC responses to MEGF10. See Supplementary Fig. 11 for quantification. Scale 
bars, 100 um (a); 50 um (b-e). All retinas are from P10-P15 animals. 


and Supplementary Fig. 11b), indicating that Megf10 gene function is 
required for SACs to respond to MEGF10 repulsive signals. Together 
these results suggest a model in which SACs and HCs use MEGF10 as 
part of a receptor complex that detects MEGF10 on their homotypic 
neighbours. This repulsive signal positions their somata so as to equalize 
MEGF10 signals on all sides, thereby creating exclusion zones (Fig. 4c). 

The phenomenon of retinal mosaicism implies a molecular system 
for cell-type-specific recognition. Several potential mechanisms have 
been proposed, based on imaging and computational studies, one being 
that short-range repulsive signals regulate tangential movements to 
establish each cell’s territory’**°. So far, however, direct mediators of 
this phenomenon have not been described. Here we provide evidence 
that, for SACs and HCs, mosaic spacing requires repellent homotypic 
interactions mediated by MEGF10 and 11. Signals initiated by 
MEGF10/11 in growing neurites could lead to repositioning of the 
soma within the cytoplasm, perhaps by mechanisms resembling those 
that direct soma translocation in response to signals at the leading 
process of migrating neurons”. The finding that MEGF10 and 11 
facilitate formation of three independent mosaics (two for SACs and 
one for HCs) appears at first to contradict the idea that mosaics are 
independent of each other and must therefore be regulated by distinct 
molecules’*°. However, because the three mosaics occupy distinct 
cellular planes, they may be exempt from the requirement for molecular 
individuation. Different molecules are likely to mediate homotypic 
interactions in other retinal subtypes, some of which may lead to soma 
translocation and others to the death of cells that violate minimal 
spacing”. 

We note two broader implications of our results. First, Draper/CED-1/ 
MEGF10 homologues have until now been studied predominantly as 
receptors for cell engulfment’?*. Here we show that they also mediate 
cell-cell repulsion and can act as ligands as well as receptors, thereby 
expanding the roles for this gene family. Second, although mosaic 
arrangements have so far been studied formally only in retina, regularly 
arranged neuronal arrays are common features of central neural 
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organization’”’. Mechanisms similar to those described here could be 


involved in promoting this regularity. 


METHODS SUMMARY 


Retinal neurons expressing fluorescent proteins were purified and used to generate 
amplified RNA for hybridizing Affymetrix microarrays as described previ- 
ously'®*’’, SAC-specific genes were identified using dChip software. Megf10 and 
Megf11 mutant mice were produced from constructs generated by the Knockout 
Mouse Project and European Conditional Mouse Mutagenesis program 
(EUCOMM)”. Histological methods'*'” and methods for electroporation of 
plasmid DNA in vivo'’~’ were described previously. HEK293 cells (ATCC) were 
cultured and transfected by standard methods. 

For analysis of spatial statistics, we sampled a 635.9 lum square at 3-4 locations 
per retina. X-Y cell coordinates, marked manually, were used to calculate Voronoi 
domain areas (Fiji) or DRP statistics such as the effective radius (that is, exclusion 
zone) and packing factor (WinDRP). We generated random arrays matched in 
density and soma size to real data; these were analysed in parallel with data from 
mutants. For measurement of SAC crowding in gain-of-function experiments, the 
Delaunay triangulation® was used to define each cell’s nearest neighbours. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. CD1 and C57BL/6 mice were obtained from Charles River and Jackson 
Labs. All experiments were carried out in accordance with protocols approved by 
the Harvard University Standing Committee on the Use of Animals in Research 
and Teaching. 

Megf10 mutant mice were produced from a construct provided by the Knockout 
Mouse Project*? (CHORI). Embryonic stem (ES) cell electroporation and 
chimaera production were performed by the Harvard Genome Modification 
Facility. ES cell clones were screened for integration by PCR using primers listed 
in Supplementary Table 4 (also see Supplementary Fig. 2). Megf11 mutant mice 
were generated from gene-targeted ES cells provided by the European Conditional 
Mouse Mutagenesis program (EUCOMM). In both cases germline transmission 
was obtained from two chimaeric mice generated from independent ES cell clones; 
each had indistinguishable phenotypes. The primers used for genotyping Megf10 
and Megf11 mice are listed in Supplementary Table 4. Drd4-GFP mice*! were 
obtained from the Mutant Mouse Regional Resource Center. 

Cell sorting and expression profiling. Retinal neurons expressing fluorescent 
proteins were purified as described'*””. Briefly, P6 retinas were dissociated, live- 
stained with antibodies recognizing cell surface antigens (if required for purifica- 
tion), and then passed through a flow cytometer (Mo Flo; Dako). Positive cells 
were then either: (1) plated, fixed, stained with cell-type-specific markers and 
counted to assess purity; or (2) sorted directly into RNA lysis buffer (PicoPure 
Kit; MDS). Gene expression was profiled using Affymetrix Mouse 430 2.0 arrays, 
following two rounds of linear amplification (MessageAmp II; Applied 
Biosystems). Using these methods we generated a gene-expression database for 
13 specific retinal neuron subtypes (five amacrine, two bipolar and six retinal 
ganglion cell types; see Supplementary Table 2). SAC-specific genes (Supplemen- 
tary Table 1) were identified using dChip software using fold-change over other 
cell types as filtering criteria. 

Histology and staining. Fixation and preparation of retinal cryosections was 
performed as described'’. For whole-mount retinal stains, the retina was dissected 
out of the eyecup in a dish of phosphate buffered saline (PBS), transferred to an 
Eppendorf tube (up to four retinas per tube), and incubated at room temperature 
(20-25 °C) for 1h with agitation in blocking solution (PBS + 0.3% Triton-X-100 
+ 3% donkey serum; Jackson Immunoresearch). Primary antibody, diluted in 
blocking solution, was then applied to the sample; incubation was for 6 days at 
4°C with agitation. Following a 2-3h wash in 2-3 changes of PBS, secondary 
antibodies (Jackson Immunoresearch) were applied overnight at 4 °C with agita- 
tion. After at least 4h of washing in PBS with agitation, retinas were flattened onto 
nitrocellulose membranes (Millipore) and mounted on slides (Fluoromount G; 
Southern Biotech). Images were acquired with a FV1000 confocal microscope 
(Olympus). 

The following antibodies were used for immunostaining: rabbit anti-calbindin 
(Swant); goat anti-ChAT (Millipore); goat anti-VAChT (Promega); guinea pig 
anti-VGLUT3 (Millipore); mouse anti-islet 1/2 (clone 39.4D5, Developmental 
Studies Hybridoma Bank (DSHB)); mouse anti-SYT2 (DSHB); chicken anti-green 
fluorescent protein (GFP; Aves labs); rabbit anti-DsRed (Clontech); rabbit anti- 
B-galactosidase’; mouse anti-glutamine synthetase (BD); sheep anti-tyrosine 
hydroxylase (Millipore); mouse anti-PKC (Abcam); rabbit anti-DAB1 (Millipore); 
rabbit anti-CART (Phoenix); mouse anti-Kv4.2 (Neuromabs); rabbit anti-SOX9 
(Millipore); rabbit anti-cleaved-caspase 3 (Cell Signaling Technology); rabbit 
anti-bNOS (Sigma); rabbit anti-EBF3 (ref. 17); rabbit anti-GAD65/67 (Millipore); 
goat anti-GLYT1 (Millipore); mouse anti-BRN3A (Millipore); mouse SMI-32 
(Sternberger). MEGF10 antibody was raised in rabbits against a His-tagged peptide 
corresponding to amino acids 879-1130 of mouse MEGF10 (Millipore). 

In situ hybridization was performed according to previously reported methods”. 
To make the template for the Megf10 antisense probe, we obtained an IMAGE clone 
(BC075647) containing full-length mouse Megf10 and linearized with ClaI. Megf11 
template, in pGEM-T Easy (Promega), was generated by PCR from brain cDNA 
using primers TGTCTTCCTTCTGCAAGCTGCTCT and ATTCCACAAGT 
GCCTGGTGAGTGT. Antisense riboprobes were transcribed using the DIG 
RNA labelling kit (Roche). Calbindin immunostaining was used to label SACs 
and HCs—at PO-P7 it is a selective marker for these two cell types, showing 
additional expression in only a small subset of amacrine and retinal ganglion cells 
(Fig. 1 and Supplementary Fig. 9, and data not shown). 

Labelling of single SACs and HCs was performed by intraocular injection of 
low-titre adeno-associated virus, as described*®. 

Quantification of mosaic regularity. For analysis of mutant phenotypes, confocal 
z-stacks through the GCL and INL were acquired from P14-17 retinas stained with 
antibodies to ChAT, calbindin, tyrosine hydroxylase, and/or VGLUT3. Each retina 
was sampled in 3-4 locations (sample size was a square, 635.9 [1m on each side). 
Stacks in which mounting artefacts introduced large local z-axis displacement of the 
SAC or HC array were not used. We sampled from both central and peripheral 


retina; despite differences in cell density between these locations the spatial 
organization of SAC, HC and VGLUT3 arrays did not appear to differ systematically. 
We therefore pooled data from central and peripheral retina for subsequent analysis. 
Using Fiji software, the centre of each cell was marked manually to generate X-Y 
coordinates. The point array was flattened to a single plane without correction for 
local curvature, but as we only used very flat images for analysis, the spatial dis- 
tortions introduced by this procedure were minor. Density recovery profiles 
(DRPs) were computed from these X-Y coordinates using WinDRP software™*’. 
Voronoi domain areas were computed in Fiji and Microsoft Excel from the X-Y 
coordinate data. SAC, HC and VGLUT3* amacrine cell density and cell diameters 
were determined from these same images using Fiji to perform measurements. 

We calculated three measures of regularity for each image. First, from the DRP, 
we obtained the effective radius. Effective radius gives the size of the exclusion 
zone—the zone in which another cell is less likely to be found than would be 
expected for a random array”’. Second (also from the DRP), we computed the 
packing factor—a regularity index that ranges between 0 (for a random array) 
and 1 (for a perfect hexagonal array). Third, we calculated the Voronoi domain 
regularity index (VDRI) by dividing the mean Voronoi domain area for a given 
cellular array by the standard deviation of those areas”’**”». 

To ascertain how the measured packing factor and VDRI compared to those 
that would be observed in cell arrays lacking mosaic spacing, we generated 
random simulations of HC, SAC, and VGLUT3* amacrine cell arrays. These 
simulations placed cells randomly in a 635.9 um square following a Poisson point 
process, until the density of the array equalled the mean density of the cell type in 
question. The only constraint on cell location in these simulations was soma 
diameter, which was calculated for each cell type as described earlier. The soma 
diameters used to constrain the simulations were 9.0 um (VGLUT3* amacrine 
cells), 11.0 um (SACs) and 12.2 4m (HCs). Programita software*® was used to 
generate the simulations (n = 10 for each cell type). Packing factor and VDRI 
were then calculated as described above, and the means were plotted as dashed 
lines in Figs 2 and 3. The VDRIs for our random simulations were similar to those 
calculated previously”. 

For analysis of SAC cell position in MEGF10-misexpressing retina, we acquired 
images at the edge of FP” or MEGF10-FP* misexpressing patches, and used the 
above method to obtain X-Y coordinates of all SACs in the field of view. SACs 
inside the patch, outside the patch, and at the edge of the patch were marked 
separately. ‘Edge’ cells were defined as those outside the FP* region for which the 
shortest line drawn from that cell to the edge of the FP" region did not pass the 
soma of another SAC. To ask whether SACs were present at higher density at patch 
edges, we calculated the distance from each SAC to its nearest neighbours. 
Neighbours were defined in an unbiased manner by computing the Delaunay 
triangulation for the X-Y location data set (Fiji), thereby defining line segments 
from each cell to its nearest neighbours. 

Statistical analysis. For analysis of exclusion zone radius, packing factor, VDRI 
and cell density, the significance of measured differences between genotypes was 
evaluated by the Mann-Whitney U test. Sample sizes were =9 images from =3 
retinas per genotype (SAC and HC analysis) or 6 images from 2 retinas per genotype 
(VGLUT3 analysis). In double-mutant experiments on HCs, the Holm-Bonferroni 
correction for multiple comparisons (four different genotypes) was applied when 
determining significance level. For measurement of soma diameter, sample size 
was = 150 cells for each genotype. 

Gain-of-function experiments. A Megf10 expression construct was generated by 
PCR amplifying the open reading frame (ORF), with stop codon deleted, from the 
IMAGE clone described earlier. The open reading frame, which was predicted to 
encode a protein equivalent to the Megf10 RefSeq sequence (NP_115822.1), was 
TA-cloned into the Gateway entry vector pCR8GW-Topo (Invitrogen). Primers 
used for cloning were CGATTGTTCTTCACAGAACATGGCG and TTGAT 
GTGATTCACTGCTGCT. A cytoplasmic domain-deletion construct was made 
by amplifying with reverse primer TGATTCCTTCCTCTTCTGCTT to generate a 
truncated protein carrying only the first nine amino acids of the intracellular 
domain. For expression, these constructs were transferred to a Gateway destina- 
tion vector bearing the ubiquitin-C promoter and an in-frame C-terminal GFP or 
monomeric Cherry tag. Mouse Megf11 was also cloned, but pilot experiments in 
HEK cells suggested a lack of surface expression, so we did not attempt in vivo 
experiments with mouse Megf11. 

In vivo electroporation was performed as described'’”’. Briefly, plasmid DNA 
(at least 1.5 mg ml ') was injected into the subretinal space of neonatal mice (4— 
36h postpartum), and current pulses (80 V) were applied across the head using 
paddle electrodes. We obtained identical results in MEGF10-GFP and MEGF10- 
Cherry misexpression experiments. Sample sizes were >20 animals each for 
MEGF10 misexpression and FP controls. 

HEK 293 cells were cultured in DMEM with 10% fetal calf serum and trans- 
fected using TransIT reagent (Mirus). Cells were counterstained using Alexa 
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Control of ground-state pluripotency by allelic 


regulation of Nanog 


Yusuke Miyanari’ & Maria-Elena Torres-Padilla' 


Pluripotency is established through genome-wide reprogramming 
during mammalian pre-implantation development, resulting in the 
formation of the naive epiblast. Reprogramming involves both the 
resetting of epigenetic marks and the activation of pluripotent-cell- 
specific genes such as Nanog and Oct4 (also known as Pou5f1)'~*. 
The tight regulation of these genes is crucial for reprogramming, 
but the mechanisms that regulate their expression in vivo have not 
been uncovered. Here we show that Nanog—but not Oct4—is 
monoallelically expressed in early pre-implantation embryos. 
Nanog then undergoes a progressive switch to biallelic expression 
during the transition towards ground-state pluripotency in the 
naive epiblast of the late blastocyst. Embryonic stem (ES) cells 
grown in leukaemia inhibitory factor (LIF) and serum express 
Nanog mainly monoallelically and show asynchronous replication 
of the Nanog locus, a feature of monoallelically expressed genes’, 
but ES cells activate both alleles when cultured under 2i conditions, 
which mimic the pluripotent ground state in vitro. Live-cell imaging 
with reporter ES cells confirmed the allelic expression of Nanog and 
revealed allelic switching. The allelic expression of Nanog is regu- 
lated through the fibroblast growth factor-extracellular signal- 
regulated kinase signalling pathway, and it is accompanied by 
chromatin changes at the proximal promoter but occurs indepen- 
dently of DNA methylation. Nanog-heterozygous blastocysts 
have fewer inner-cell-mass derivatives and delayed primitive endo- 
derm formation, indicating a role for the biallelic expression of 
Nanog in the timely maturation of the inner cell mass into a fully 
reprogrammed pluripotent epiblast. We suggest that the tight 
regulation of Nanog dose at the chromosome level is necessary for 
the acquisition of ground-state pluripotency during development. 
Our data highlight an unexpected role for allelic expression in 
controlling the dose of pluripotency factors in vivo, adding an extra 
level to the regulation of reprogramming. 

The development of the naive epiblast in the late blastocyst is 
orchestrated by pluripotency-associated transcription factors such as 
NANOG and OCT4. OCT4 is required for establishing and maintain- 
ing the pluripotent state’, and NANOG is essential for the acquisition 
of pluripotency”®. In ES cells, the pluripotent state is also governed by 
these core transcription factors. Nanog occupies a central position in 
this network, but the mechanisms regulating its expression are unclear. 

To address how pluripotency-associated factors are regulated during 
reprogramming in vivo, we assayed the de novo gene expression of 
Nanog and Oct4 in mouse pre-implantation embryos, by using RNA 
fluorescence in situ hybridization (RNA-FISH), which allows nascent 
transcripts to be visualized. We found that Oct4 is actively expressed in 
a small proportion of 2-cell-stage embryos, but most blastomeres 
express Oct4 at the 4- and 8-cell stage (Fig. 1a, b). We detected active 
Nanog expression in about half of all 4- and 8-cell-stage blastomeres, 
which is consistent with previous reports of heterogeneous Nanog 
expression’. Surprisingly, although all of the other genes that we ana- 
lysed were expressed biallelically, Nanog invariably showed monoallelic 
expression in 2-, 4- and 8-cell-stage embryos (Fig. la, b and Sup- 
plementary Movie 1). We next addressed whether the monoallelic 


expression of Nanog is random or imprinted, by using allele-specific, 
single-cell PCR with reverse transcription (RT-PCR) on 8-cell- 
stage embryos from C57BL/6J < Mus musculus castaneus crosses. 
Consistent with our RNA-FISH results, most blastomeres expressed 
Nanog monoallelically (Fig. 1c and Supplementary Fig. 1). We observed 
no bias in Nanog expression from either the C57BL/6J or the M. musculus 
castaneus allele (Fig. 1c). We conclude that Nanog expression is 
monoallelic and random during early pre-implantation development. 

Nanog expression becomes restricted to the inner cell mass (ICM) of 
the blastocyst'”, where it is needed for the naive epiblast to attain 
pluripotency’. As expected, the proportion of cells expressing Nanog 
and Oct4 declined gradually as the expression of these genes became 
restricted to inner cells (Fig. 2a). Remarkably, the proportion of 
biallelism in Nanog-expressing cells increased progressively from 2% 
at the 8-cell stage to 70% of the inner cells in the late blastocyst (Fig. 2b 
and Supplementary Movie 2). This switch to biallelism of Nanog is in 
contrast to the consistent biallelic expression of Oct4 and Actb 
(Fig. 2a). Sequential RNA-FISH and DNA-FISH confirmed the 
biallelic expression of Nanog (Supplementary Fig. 1). The ICM of 
the late blastocyst contains two lineages: the extra-embryonic 
primitive endoderm, and the ‘ground-state’ pluripotent epiblast®*, 
which gives rise to the embryo. Inner cells expressing Nanog 
biallelically also express Oct4 but not Gata4, a primitive endoderm 
marker’, and therefore are epiblast cells (Fig. 2c). This finding suggests 
that Nanog is predominantly expressed from both alleles in the 
naive epiblast. Furthermore, Nanog expression gradually reversed to 
monoallelic expression after implantation (Supplementary Fig. 2), in 
line with the downregulation of Nanog and concomitant loss of 
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Figure 1 | Nanog expression is monoallelic in early embryos. a, RNA-FISH 
for Nanog and Oct4 in 4- and 8-cell-stage nuclei. Scale bar, 2 um. b, Proportion 
of allelic expression for each gene in 2-cell-stage (2C), 4-cell-stage (4C) and 
8-cell-stage (8C) blastomeres. n, number of cells analysed. c, Summary of 
single-cell RT-PCR data for Nanog in 8-cell-stage blastomeres, according to 
biallelic, monoallelic (C57BL/6] or M. musculus castaneus (Castaneus)) and no 
expression (none). 


lMnstitut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS/INSERM U964, Université de Strasbourg, F-67404 Illkirch, Cité Universitaire de Strasbourg, France. 


470 | NATURE | VOL 483 | 22 MARCH 2012 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


@ RNA-FISH of embryos B Monoallelism b B Monoallelism © RNA-FISH of inner cells at LBL stage 

S @ Biallelism = & Biallelism 

E 499 Nanog Oct4 Actb x = 

2 > 100 

= ¢0 2 

2 gs 80 

ES = 

w 60 4 60 

u 2: 

< 40 a 40 

Zz o 

fr £ 20 

= 20 x 

= é 9 

s 2 o0Aaa 

oO Ode OVOdR OOVdg esa 

O ee€gnsteolsnsteeags < All Inner c 
n: 99 237 284528554 99 237284528554 65 109 172.425 523 n: 49 60 141 


e 
RNA-FISH of ES cells in LIF RNA-FISH of ES cells in LIF 
HMonoallelism i Biallelism 
= 100 
‘< 80 
2 
i 60 
Z 40 
3 
2 20 
2 
< oo 


N: 200 200 112 102 105 104 108 98 117112 


Figure 2 | Nanog shows biallelic expression in the naive, pluripotent 
epiblast. a, RNA-FISH for Nanog, Oct4 and Actb in 8-cell-stage (8C), 16-cell- 
stage (16C) and 32-cell-stage (32C) embryos (before cavitation), and early 
blastocysts (EBLs; E3.5) and late blastocysts (LBLs; E4.25). Data are shown as 
the percentage of Nanog-expressing cells (or Oct4- or Actb-expressing cells) 
relative to all cells analysed. b, Quantification of cells showing monoallelic or 
biallelic Nanog expression in Nanog-expressing 8-cell-stage blastomeres and 
inner cells at the 32-cell, EBL and LBL stages. *, P = 8.8 X 10°; 
**,P=3.1X 10% ***, P<1.0X 10 (Fisher’s exact test). c, RNA-FISH in 


pluripotency*. Thus, Nanog expression switches transiently from 
monoallelic to biallelic during the formation of the pluripotent epiblast 
in vivo, coinciding with the completion of the reprogramming process. 

We next investigated the allelic expression of Nanog in ES cells. 
When cultured in medium containing serum and LIF, 60-70% of ES 
cells express Nanog (refs 10, 11) (Supplementary Fig. 4). Under these 
conditions, Nanog is predominantly expressed from a single allele 
(Fig. 2d, e and Supplementary Movie 3). By contrast, all of the other 
genes analysed showed biallelic expression, including other pluripotency- 
associated genes (Oct4, Sox2, KIf2 and Fgf4), heterogeneously expressed 
genes (Pecam1, Bmp4 and Rex] (also known as Zfp42)) and housekeep- 
ing genes (Actb and Gapdh) (Fig. 2d, e). Allele-specific, single-cell RT- 
PCR with hybrid ES cells showed that Nanog expression is independent 
of parental origin, similarly to the early embryo (Supplementary Fig. 3). 
Thus, Nanog shows random monoallelic expression both in early pre- 
implantation embryos and in ES cells. 

The proportion of monoallelic versus biallelic Nanog expression in 
the naive epiblast is significantly different from that in ES cells, the 
latter showing much lower biallelism (Fig. 2b, e). We hypothesized that 
this could reflect the more homogeneous expression of Nanog in the 
naive epiblast compared to ES cells**. Ground-state pluripotency 
can be established in vitro through pharmacological inhibition (with 
two inhibitors (2i)) of MEK and GSK3B”. ES cells cultured with 2i 
show increased levels of Nanog messenger RNA°’* and homogeneous 
distribution of NANOG protein’ (Supplementary Fig. 4). Strikingly, 
we found that culturing ES cells in 2i resulted in a significant increase 
in biallelic Nanog expression (Fig. 2f, g, Supplementary Fig. 3 and 
Supplementary Movie 4). We confirmed Nanog biallelic expression 
with sequential RNA-FISH and DNA-FISH (data not shown). To 
address which signalling pathway(s) regulates the allelic expression 
of Nanog, we used combinations of pharmacological inhibitors®’*"*. 
We found that blocking fibroblast growth factor (FGF)-extracellular 
regulated kinase (ERK) signalling was sufficient to trigger the biallelic 
expression of Nanog (Supplementary Fig. 4). The naive epiblast and ES 
cells are thought to be maintained in a ground state by preventing the 
differentiation that is induced through FGF-ERK signalling*’”. Thus, 
collectively these results suggest that ground-state pluripotency might 
be achieved through the activation of the second allele of Nanog. 
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ICM cells of the LBL. The dashed line delineates individual ICM cells. d, ES cells 
cultured with serum and LIF display primarily monoallelic Nanog expression. 
A representative RNA-FISH image of ES cells is shown. e, Distribution of 
monoallelism and biallelism of Nanog and the indicated genes in ES cells 
cultured with LIF. f, A representative RNA-FISH image of nuclei from ES cells 
cultured with 2i/LIF, showing biallelic expression of Nanog. g, Treatment of ES 
cells with 2i/LIF leads to increased biallelic Nanog expression. The proportion 
of allelic expression of Nanog, Oct4, and Actb is shown for ES cells cultured with 
LIF or 2i/LIF. a, b, g, n, number of cells analysed. ¢, d, f, Scale bars, 2 |1m. 


The above changes in Nanog allelic expression upon pharma- 
cological treatment suggest that the Nanog alleles are dynamically 
regulated. To visualize the dynamics of activation of the two Nanog 
alleles in individual cells, we generated an ES-cell reporter in which a 
gene encoding destabilized Turbo green fluorescent protein 
(TurboGFP) was inserted immediately downstream of the NANOG- 
coding region in one Nanog allele and a gene encoding destabilized 
mCherry was inserted similarly into the other Nanog allele (Fig. 3a and 
Supplementary Fig. 5). Both fluorescent proteins dissociate from 
NANOG by self-cleavage of a 2A peptide and do not alter NANOG 
function. We observed heterogeneous distribution of TurboGFP or 
mCherry in ES cells grown in LIF, confirming that the monoallelic 
expression of Nanog is random (Fig. 3b). This is in contrast to the 
homogeneous distribution in ES cells grown in 2i, in which most cells 
expressed both fluorescent proteins, reflecting biallelic expression of 
Nanog (Fig. 3b). Using time-lapse analysis, we observed dynamic fluc- 
tuation of Nanog expression, in agreement with previous reports’?! 
(Supplementary Fig. 6 and Supplementary Movie 5). Moreover, we 
found that Nanog expression can switch between alleles. In most cases, 
the allelic switch occurred through an intermediate state of biallelic 
Nanog expression or no Nanog expression over several cell cycles. Ina 
minority of cases (2%), rapid allelic switching occurred during a single 
cell cycle (Fig. 3c and Supplementary Fig. 6). This switching is in 
contrast to other monoallelic genes, including imprinted genes, for 
which the inactive status of an allele is stably maintained”. 

Given the potential importance of activation of the second Nanog 
allele in establishing ground-state pluripotency, we next addressed the 
chromatin signatures underlying biallelic activation. We asked whether 
the allelic expression of Nanog obeys any epigenetic feature. We found 
that Nanog replicated asymmetrically in ES cells cultured with LIF 
alone—a feature of monoallelically expressed genes—but changed its 
replication pattern towards symmetric replication in ES cells cultured 
with 2i and LIF (denoted 2i/LIF) (Fig. 3d, e). This pattern is in contrast 
to the invariable asymmetric replication of imprinted genes (Snurf and 
H19; Fig. 3d, e) and is consistent with changes in allelic Nanog expres- 
sion. Replication timing is a distinctive epigenetic fingerprint”’*’”, thus 
suggesting that the allelic regulation of Nanog could be associated with 
epigenetic signatures. Indeed, monoallelic expression is accompanied 
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Figure 3 | The switch to biallelic expression of Nanog is accompanied by 
changes in replication timing and increased recruitment of mediator and 
cohesin. a, Schematic of the Nanog knock-in reporter NGR. A PEST motif in 
the carboxy terminus of the fluorescent proteins allows monitoring of dynamic 
Nanog expression. iHyg, internal ribosome entry site (IRES) hygromycin; iNeo, 
IRES neomycin; mChe, mCherry; NLS, nuclear localization signal; tGFP, 
TurboGFP. b, Representative image of NGR ES cells cultured with LIF or 2i/ 
LIF. Scale bar, 10 um. ¢, The incidence of allelic switching of Nanog expression 
in ES cells. Cells were classified into four groups: monoallelic (TurboGFP- 
positive, green), monoallelic (mCherry-positive, red), biallelic (TurboGFP- and 
mCherry-positive, yellow) and no expression (black). The proportion of cells 
undergoing a transition between these four groups during a single cell cycle is 
indicated. Overall, 47% of cells showed a colour change in this period. n, 
number of cells analysed. d, The asymmetric replication of Nanog in ES cells 
cultured with LIF changes to symmetric replication upon treatment with 2i. 


15,18 


by allele-specific DNA methylation and/or histone modifications 
However, we found that ES cells lacking all three DNA methyltrans- 
ferases’? had similar Nanog allelic expression to wild-type ES cells 
(Fig. 3f), indicating that the allelic regulation of Nanog is independent 
of DNA methylation, unlike that of imprinted genes". 

Next, we assessed allelic histone modifications on Nanog by using 
chromatin immunoprecipitation (ChIP), taking advantage of the fact 
that ES cells grown in LIF express Nanog mainly in a monoallelic manner 
and those grown in 2i/LIF primarily have two active alleles. ChIP showed 
a significantly higher enrichment of H3K4me3, a modification of active 
chromatin, at the transcription start site (TSS) of Nanogin ES cells grown 
in 2i/LIF than those grown in LIF alone (Fig. 3g). This increase is in line 
with the higher biallelic expression in 2i/LIF than LIF alone. The 
H3K4me3 enrichment at the Nanog TSS was approximately twofold 
in 2i/LIF, possibly suggesting that H3K4me3 is enriched at the active 
Nanog allele but absent at the inactive one in ES cells expressing Nanog 
monoallelically. We observed significantly higher recruitment of the 
mediator subunit MED12, as well as the cohesin-loading factor 
NIPBL, to the Nanog enhancer in ES cells grown in 2i/LIF than in ES 
cells grown in LIF alone (Fig. 3g). This increased recruitment was accom- 
panied by increased RNA polymerase II initiation and elongation upon 
2i treatment (Supplementary Fig. 7). Indeed, changes in chromatin 
architecture mediated through simultaneous binding of mediator and 
cohesin to the Nanog locus facilitate looping between the enhancer and 
promoter, resulting in Nanog gene activation”. Collectively, our data 
suggest that Nanog allelic expression might be actively regulated through 
chromosomal changes over the Nanog locus. 

Mice that are heterozygous for Nanog (Nanog‘'~) are born at 
normal Mendelian ratios’. We hypothesized that if Nanog biallelic 
expression is important for the formation of the ICM and/or for the 
acquisition of ground-state pluripotency, then Nanog*’~ embryos 
may show developmental defects at the onset of Nanog biallelic 
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expression. To address this, we counted ICM cells in individual 
Nanog*'~ blastocysts by using an anti-OCT4 antibody. Freshly collected 
embryonic day 3.5 (E3.5) Nanog*'~ and Nanog*’* blastocysts had 
similar ICM cell numbers (Supplementary Fig. 8a). This finding is in 
agreement with previous reports®*’ and with our observations 
documenting Nanog monoallelic expression before this embryonic 
stage. By contrast, we found significantly fewer ICM cells in 
Nanog*'~ blastocysts as the embryo progressed through development 
to form the nascent epiblast (Fig. 4a), which also showed increased 
apoptosis in the ICM (Supplementary Fig. 8b). No significant differ- 
ence in the number of NANOG-positive cells was observed between 
wild-type (Nanog’’*) and heterozygous (Nanog*’) embryos 
(Fig. 4b, c). However, Nanog*’~ embryos showed delayed primitive 
endoderm formation (Fig. 4b, c). Indeed, primitive endoderm forma- 
tion depends on a functional epiblast in a non-cell-autonomous 
manner”, suggesting that the epiblast in Nanog*’~ embryos is func- 
tionally impaired. It should be noted that the reduction in primitive 
endoderm cells does not fatally compromise development, owing to 
the regulatory capacity of the early post-implantation embryo. Thus, 
we conclude that the biallelic expression of Nanog is necessary for the 
timely maturation of the ICM into a functional epiblast and the 
accompanying primitive endoderm. 

NANOG has a dose-sensitive action in reprogramming®. Over- 
expression of NANOG is sufficient to prevent ES-cell differentiation’; 
conversely, Nanog*! ~ ES cells show spontaneous differentiation”, 
suggesting that the tight regulation of Nanog levels is crucial for both 
reprogramming and differentiation. We found that this dose regu- 
lation occurs principally at the allelic level. Monoallelic expression is 
predominant during the early reprogramming phase in cleavage stages, 
but it switches to biallelic expression as the ICM matures into the 
pluripotent epiblast (Fig. 4d). Monoallelic Nanog expression in ES cells 
may be representative of the late ICM or the early post-implantation 
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Figure 4 | Biallelic expression of Nanog is required for the timely 
maturation of the ICM derivatives. a, Reduced proportion of ICM derivatives 
in Nanog''~ blastocysts compared to Nanog‘’* blastocysts. Mid and late 
blastocysts were staged according to their total cell number. Data are expressed 
as the percentage of ICM cells per embryo. The box plot represents the 
ninetieth, seventy-fifth, fiftieth (median), twenty-fifth and tenth percentiles, as 
well as the mean (black dot). b, Delayed primitive endoderm (PE) formation in 
Nanog*’~ blastocysts. Embryos were staged in groups according to their total 
cell number. The number of PE (GAT A4-positive) and epiblast (EPI; NANOG- 
positive) cells was determined. c, Representative maximal confocal projections 
of late blastocysts, showing a reduced proportion of ICM derivatives in 
Nanog*'~ embryos. Scale bar, 10 pm. d, Model for the allelic regulation of 
Nanog and the acquisition of ground-state pluripotency. Nanog is expressed 
monoallelically during the cleavage stages and up to the early blastocyst. 
Biallelic expression of Nanog accompanies the formation of the fully 
reprogrammed, pluripotent, epiblast. Biallelic expression of Nanog also occurs 
in vitro under conditions that mimic the ground-state pluripotency of the naive 
epiblast (ES cells cultured with 2i/LIF). 


epiblast, in which there is a switch back to monoallelic expression. The 
expression of Nanog can switch between alleles, unlike most known 
monoallelic genes, raising the possibility that the regulation of Nanog 
allelic expression could be unique. Our evidence that such regulation is 
independent of DNA methylation and does not seem to involve typical 
repressive modifications (H3K27me3, H3K9mel, H3K9me2 and 
H3K9me3; data not shown) also supports this possibility. The 
ground-state pluripotency is facilitated by the suppression of FGF- 
ERK signalling*’*. Our work suggests that the allelic regulation of 
Nanog is involved in this process and that the increased expression of 
Nanog that accompanies the establishment of ground-state pluripotency 
is achieved through activation of its second allele. The dose regulation of 
Nanog through allelic switching could also contribute to promoting 
heterogeneity, which could provide a window of opportunity both for 
establishing ground-state pluripotency in the epiblast and for lineage 
segregation towards the primitive endoderm. We propose that the 
regulation of Nanog at the allelic level provides a novel mechanism 
to establish ground-state pluripotency in the naive epiblast. Thus, our 
work has uncovered an unexpected role for the allelic regulation of key 
transcription factors, as a novel mechanism in stabilizing pluripotency. 


METHODS SUMMARY 

The embryos were from F; (C57BL/6J X CBA/H) or C57BL/6) X M. musculus 
castaneus crosses. The blastocysts from Nanog*/* x C57BL/6] crosses”! were 
genotyped after immunostaining (47 Nanog'’* and 49 Nanog*’~ were analysed). 
The ES-cell lines and culture conditions are described in the Methods. Briefly, 
when referring to LIF, ES cells were cultured in 15% FCS and LIF; when referring 
to 2i/LIF, ES cells were cultured in 15% FCS, LIF, 3 uM CHIR99021 and 1 uM 
PD0325901. Embryos and ES cells were fixed with 4% paraformaldehyde in PBS for 
immunostaining. Images were acquired on a TCS SP5 confocal microscope (Leica). 
Image analysis was performed with the programs ImageJ, LAS AF Lite (Leica) 
or Imaris (Bitplane). For single-cell RT-PCR analysis, individual 8-cell-stage 
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blastomeres were mechanically dissociated, and the cells were collected in 3.1 pl 
Cell Lysis Buffer (Ambion) and processed for RT-PCR then digested with Avall. All 
primer sequences are listed in Supplementary Table 1. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Embryo collection and culture. Embryos were collected from ~6 week-old F, 
(C57BL/6] X CBA/H) superovulated females crossed with F, males. To obtain F, 
hybrid embryos, superovulated C57BL/6] females were crossed with M. musculus 
castaneus males. Embryos were collected at the following times after human 
chorionic gonadotrophin injection: the 2-cell stage (48 h), 4-cell stage (54 h), 8-cell 
stage (68h), 16-cell stage (80h), 32-cell stage (90h), early blastocyst stage (98 h) 
and late blastocyst stage (114h). To obtain late blastocysts, freshly collected early 
blastocysts were cultured in KSOM for 16h. To obtain E4.5, E4.75 and E5.0 stage 
embryos, embryos were collected from naturally mated females. The analysis of 
Nanog’'~ embryos was done using the Nanog“ line”. Blastocysts from 
Nanog*! ~ X C57BL/6J (+/+) crosses were processed for immunostaining and 
reconstructed in three dimensions using Imaris software. Individual ICM cells 
were counted after immunostaining with an anti-OCT4 antibody. The number of 
primitive endoderm and epiblast cells was determined by counting individual cells 
after immunostaining for GATA4 and NANOG. The total number of cells was 
determined by counting nuclei stained with 4',6-diamidino-2-phenylindole 
(DAPI). Individual blastocysts were genotyped after confocal acquisition as 
described previously’. Primer sequences for genotyping are listed in 
Supplementary Table 1. All experiments were performed in accordance with the 
current legislation in France and the approval of the Regional Ethics Committee 
(ComEth’s). 

ES-cell culture. Mouse ES-cell lines, E14, BC1 and TKO (the latter of which lacked 
all three DNA methyltransferases), were cultured in DMEM with GlutaMAX 
(Invitrogen) containing 15% FCS, LIF, 1mM sodium pyruvate, penicillin/ 
streptomycin and 0.1 mM 2-mercaptoethanol. The treatment of ES cells with 
inhibitors was performed using 3 4M CHIR99021 (a GSK3f inhibitor), 1 4M 
PD0325901 (a MEK inhibitor), 5 uM PD173074 (an FGF receptor tyrosine kinase 
activity inhibitor) and 3 UM PD184352 (a MEK inhibitor). The hybrid ES-cell line 
(BC1) was derived from E3.5 embryos from C57BL/6 X M. musculus castaneus 
crosses as described previously’’. 

Gene targeting. Mouse ES cells from the BD 10 line (C57BL/6) were electroporated 
with a first targeting vector consisting of 2A-3 TurboGFP-NLS-PEST-IRES-Neo. 
The resultant G418-resistant ES-cell (Nanog“'?’*) clone was subsequently sub- 
jected to a second targeting with a vector containing 2A-3mCherry-NLS-PEST- 
IRES-Hyg. Genomic DNAs from both G418- and hygromycin-B-resistant colonies 
were screened for homologous recombination into each Nanog allele by PCR and 
Southern blotting. 

Immunostaining. Embryos or ES cells were fixed with 4% paraformaldehyde in 
PBS for 20min at room temperature. After washing with PBS, embryos were 
permeabilized with 0.5% Triton X-100 in PBS for 10 min and then incubated in 
blocking solution (0.2% BSA in PBS) for 30 min. Primary antibodies were anti-OCT4 
(611202, BD Pharmingen), anti NANOG (MLC-51, eBioscience) and anti-GATA4 
(sc-1237, Santa Cruz Biotechnology). After incubation in blocking solution contain- 
ing primary antibodies for 1 h, cells or embryos were washed three times with 0.01% 
Triton X-100 in PBS for 5 min each and then incubated in blocking solution contain- 
ing secondary antibodies labelled with Alexa fluorophores (Invitrogen), Cy3, Cy5 
(Jackson ImmunoResearch) or Kodak X-SIGHT 640 (Carestream Health). After 
washing with PBS, mounting was done in VECTASHIELD (Vector Labs). Images 
were collected on a TCS SP5 confocal microscope (Leica). For all images, DAPI is 
shown in blue. Image analysis was performed using the software ImageJ, LAS AF Lite 
(Leica) and Imaris (Bitplane). 

Whole mount RNA-FISH. After removal of the zona pellucida by incubation in 
acidic Tyrode’s solution (Sigma), embryos were washed twice with PBS and then 
incubated in fixative containing 4% paraformaldehyde/1 PBS for 20 min at room 
temperature. Embryos were permeabilized with 0.5% Triton X-100 in fixative for 
10 min. After washing with PBS three times, embryos were pre-hybridized in a 2 il 
drop of hybridization buffer covered with mineral oil on a 35-mm glass bottom 
dish for 30 min at 50°C. The hybridization buffer consisted of 50% formamide, 
10% dextran sulphate, 2*SSC, 1 pg ul”! Cotl DNA, 1 ug ul”! yeast tRNA 
(Roche), 2mM vanadyl ribonucleoside complex (New England BioLabs), 1 mg 
ml | polyvinyl pyrrolidone (PVP), 1mM EDTA, 0.1% Triton X-100 and 1 mg 
ml | BSA. Embryos were then transferred to a 2 pil drop of hybridization buffer 
containing 5ng pl * fluorescent probes and incubated at 50°C overnight. After 
washing three times with 2XSSC, 0.1% Triton X-100 and 1 mg ml‘ PVP at 50°C 
for 10 min, embryos were then subjected to a gradient series (20%, 40%, 60%, 80% 
and 100%) of VECTASHIELD containing DAPI. Images were acquired on a TCS 
SP5 confocal microscope with a 63 glycerol immersion objective lens. For RNA- 
FISH of ES cells, ES cells grown on gelatin-coated coverslips were fixed and 
permeabilized as described above. After briefly washing with PBS, the coverslip 
was dehydrated in 70% ethanol and 100% ethanol for 10 min each and was sub- 
sequently air dried. Hybridization buffer containing the corresponding probes was 
applied to the coverslip. Subsequent hybridization, washing and mounting were 


performed as described above. The plasmids, BACs and fosmids used as probes 
for RNA-FISH were as follows: PBSII-Na1234 for Nanog, GOF6.1 for Oct4, 
G135P62831G1 and G135P630G2 for Gata4, RP23-476J7 or G135P6579H5 for 
Actb, G135P60852B4 for Sox2, G135P65370D8 for K/f2, G135P600983F2 for Fef4, 
G135P60839B3 for Bmp4, RP23-129L1 for Gata6, G135P65026G5 for Hsp70.1, 
G135P604053D8 for Pecam1, G135P69364G7 for Stella, G135P602899G3 for 
Rex1 and G135P60163E2 for Gapdh. The PBSII-Na1234 plasmid contains the 
NANOG-coding region without any repetitive sequences, as determined by 
RepeatMasker. BAC and fosmid probes were obtained from the BACPAC 
Resources Center. The specificity of all probes was confirmed by PCR using 
specific primers. Probes were labelled with handmade Alexa 488-dATP, 
TAMRA-dATP, ATTO 647N-dATP using a nick translation kit (Roche) and 
purified with a QIAquick PCR Purification Kit (QIAGEN). Images were analysed 
by the software Image] and LAS AF Lite. For all of the images, DAPI is shown in 
blue. It should be noted that the monoallelic expression of Nanog was independent 
of the genetic background, as it was reproducible in all of the mouse strains 
analysed (F, (C57BL/6]  CBA/H), Fy (C57BL/6J x M. musculus castaneus), 
CD1 and C57BL/6); data not shown). 

Statistical analysis of RNA-FISH. Only interphase cells were taken into account 
for all of the analyses. Images were analysed across three- dimensional planes by using 
the software Image] and LAS AF Lite. The RNA-FISH signals, which were detected 
over three sequential Z-sections were judged as positive signals to distinguish back- 
ground noise. Single RNA-FISH spots or two adjacent spots within 1 pum, which 
might result from DNA replication, were judged as monoallelism. Cells with two, 
three or four separate RNA-FISH spots were scored as biallelism. Statistical analysis 
was done using Fisher’s exact test, and the number of cells analysed in all experiments 
is indicated in the corresponding figure. 

DNA-FISH. DNA-FISH was performed as previously described**. Probes for 
DNA-FISH were as follows: RP23-117123 for Nanog, RP23-213M12 for Oct4, 
RP23-476J7 for Actb, RP23-106M4 for Cdx2, RP23-239M21 for Snurf and 
G135P602165C11 for H19. For all DNA-FISH images, DAPI is shown in blue. 
Allele-specific, single-cell RT-PCR. Individual 8-cell-stage blastomeres from 
hybrid embryos or single hybrid BC1 ES cells were used for this assay. Embryos 
or ES cells were treated with 0.25% trypsin and 1 mM EDTA at 37°C for 5 min 
with gentle pipetting. Single cells were manually collected by mouth pipette aided 
bya finely pulled glass tip, directly into 3.1 il Cell Lysis Buffer (Ambion) contain- 
ing RNase inhibitor (Invitrogen) and 73 nM specific primers for Nanog and Actb. 
The single-cell samples were snap frozen in liquid nitrogen and stored at —80°C 
until use. Lysates were incubated at 65 °C for 5 min and placed on ice for 5 min. 
Reaction mixture (7 ul) (2 pl 5XRT buffer, 2 ul 2.5mM dNTPs, 0.1 pl RNase 
inhibitor, 0.2 ul Transcriptor (Roche) and 2.7 11 water) was added to each tube. 
Reverse transcription was performed at 25 °C for 10 min, 37 °C for 15 min, 55 °C 
for 30 min and 85°C for 5 min. The reaction mixtures were subjected to a first 
round of multiplex PCR with primer mixtures for Nanog and Actb. This first PCR 
reaction mixture (0.5 ul) was subsequently used for a second round of PCR with a 
specific primer set for Nanog or Actb. Pwo SuperYield DNA Polymerase (Roche) 
was used for PCR reactions. After the second PCR round, PCR products for Nanog 
were digested with the AvalI restriction enzyme and analysed by acrylamide gel 
electrophoresis. Primer sets for Nanog- and Cdkn1-flanking single nucleotide 
polymorphism (SNP) sites are listed in Supplementary Table 1. SNPs in the 
NANOG-coding region were identified by using The Jackson Laboratory’s 
Mouse SNP database (http://phenome.jax.org/SNP/). SNPs in Cdkn1 are described 
elsewhere”. Primer sequences are listed in Supplementary Table 1. 

Quantitative (real-time) PCR. RNA was extracted from ES cells with an RNeasy 
Mini Kit (QIAGEN) and reverse transcribed with Transcriptor (Roche). Real-time 
PCR was performed with SYBR Green JumpStart Taq ReadyMix (Sigma) on a 
LightCycler 480 Real-Time PCR System (Roche). The relative expression level of 
each gene was normalized to the Gapdh expression level. Primer sequences are 
listed in Supplementary Table 1. 

ChIP assay. ChIP assays were performed with ES cells as previously described”*. 
Primer sequences are listed in Supplementary Table 1. The antibodies used were 
anti-histone H3 (ab1791, Abcam), anti-H3K4me3 (pAV-MEHAHS-024, 
Diagenode) anti-MED12 (A300-774A, Bethyl Laboratories), anti-NIPBL (301- 
779A, Bethyl Laboratories), anti-RNA polymerase II (PolII) (N-20, Santa Cruz 
Biotechnology), anti-PolII CTDSer2P (ab5095, Abcam) and anti-PollI CTDSer5P 
(ab5131, Abcam). The enrichment of histone modifications was quantified by 
real-time PCR as described above and normalized to histone H3. For MED12, 
NIPBL, PollI, PolII CTDSer2P and PolIICTDSer5P, the percentage input was 
calculated for each ChIP fraction. 

Time-lapse imaging. NGR ES cells were cultured in Knockout DMEM (Invitrogen) 
supplemented with 15% Knockout Serum Replacement (Invitrogen), LIF, non- 
essential amino acids, glutamine, gentamicin and 2-mercaptoethanol, under condi- 
tions of 5% CO, and 95% air. The cells were grown on Culture-Insert (Ibidi) placed 
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on laminin-511 (Biolamina)-coated 3-cm glass-bottomed dishes (Mattek). The 
dishes were placed in an incubation chamber (Tokai Hit) on the microscope stage 
at 37 °C. An inverted microscope (Leica) attached to a Nipkow disk confocal micro- 
scope (Yokogawa Electric) and EMCCD camera (iXon, Andor Technology) was 
controlled with iQ software (Andor Technology). TurboGFP and mCherry were 
excited with 488-nm and 560-nm lasers, respectively. Images for each colour (green 
or red) were acquired across 17.5 um (7 Z-planes) every 20 min for ~50h. 

Analysis of live-cell imaging data. Time-lapse data were analysed using the 
Image] software plug-in Circadian Gene Expression (CGE)”. Fluorescent images 
were merged and used for the segmentation of nuclei. Cells (n = 200) were randomly 
selected, and the fluorescence intensities for TurboGFP and mCherry in each 
nucleus were measured in each time frame. Background intensity was subtracted 
from the mean fluorescence intensity and plotted as the expression level against 
time during a single cell cycle (~10 h) as shown in Supplementary Fig. 6. Cells that 
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were positive for TurboGFP, positive for mCherry, positive for both or negative for 
both were scored as monoallelic (green), monoallelic (red), biallelic (yellow) or 
non-expressed (black), respectively. Based on the kinetics of TurboGFP and 
mCherry during a single cell-cycle, the proportion of cells showing a transition 
between these four groups was calculated. 
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IDH mutation impairs histone demethylation and 
results in a block to cell differentiation 


Chao Lu’, Patrick S. Ward”, Gurpreet S. Kapoor?, Dan Rohle*®, Sevin Turcan*, Omar Abdel-Wahab*®, Christopher R. Edwards’, 
Raya Khanin®, Maria E. Figueroa’, Ari Melnick®, Kathryn E. Wellen?, Donald M. O’Rourke?°, Shelley L. Berger’, 
Timothy A. Chan‘, Ross L. Levine*’, Ingo K. Mellinghoff**"' & Craig B. Thompson! 


Recurrent mutations in isocitrate dehydrogenase 1 (IDH1) and 
IDH2 have been identified in gliomas, acute myeloid leukaemias 
(AML) and chondrosarcomas, and share a novel enzymatic property 
of producing 2-hydroxyglutarate (2HG) from a-ketoglutarate’*. 
Here we report that 2HG-producing IDH mutants can prevent 
the histone demethylation that is required for lineage-specific 
progenitor cells to differentiate into terminally differentiated cells. 
In tumour samples from glioma patients, IDH mutations were 
associated with a distinct gene expression profile enriched for genes 
expressed in neural progenitor cells, and this was associated with 
increased histone methylation. To test whether the ability of IDH 
mutants to promote histone methylation contributes to a block in 
cell differentiation in non-transformed cells, we tested the effect of 
neomorphic IDH mutants on adipocyte differentiation in vitro. 
Introduction of either mutant IDH or cell-permeable 2HG was 
associated with repression of the inducible expression of lineage- 
specific differentiation genes and a block to differentiation. This 
correlated with a significant increase in repressive histone methyla- 
tion marks without observable changes in promoter DNA methyla- 
tion. Gliomas were found to have elevated levels of similar histone 
repressive marks. Stable transfection of a 2HG-producing mutant 
IDH into immortalized astrocytes resulted in progressive accu- 
mulation of histone methylation. Of the marks examined, increased 
H3K9 methylation reproducibly preceded a rise in DNA methyla- 
tion as cells were passaged in culture. Furthermore, we found that 
the 2HG-inhibitable H3K9 demethylase KDM4C was induced 
during adipocyte differentiation, and that RNA-interference suppres- 
sion of KDM4C was sufficient to block differentiation. Together 
these data demonstrate that 2HG can inhibit histone demethylation 
and that inhibition of histone demethylation can be sufficient to 
block the differentiation of non-transformed cells. 

The fact that IDH mutations were identified in multiple cancers 
with disparate tissues of origin suggests that 2HG-producing mutant 
enzymes probably affect some fundamental cellular processes that 
facilitate tumour progression. To study the effects of IDH mutations, 
we collected and performed gene expression microarray analysis on 
tumour specimens from patients with grade II-III oligodendroglioma. 
Sequencing results revealed a high frequency of IDH mutations in 
oligodendroglioma (33 of the samples had the R132 IDH1 mutation, 
2 had the R172 IDH2 mutation and 6 were wild type for IDH1/2). 
Supervised analysis found a statistically enriched gene signature in 
IDH-mutant samples (q value <10%, fold change >2; Fig. 1a and Sup- 
plementary Table 1) that was independent of tumour grade and recur- 
rence status and survived multiple testing corrections. Gene-ontology 
analysis identified the regulation of astrocyte and glial differentiation 


as the top two functional categories enriched in differentially expressed 
genes (Supplementary Table 2). We previously reported that IDH 
mutation may promote leukaemogenesis by expanding the haemato- 
poietic progenitor cell population and impairing haematopoietic dif- 
ferentiation’, and that such a phenotype could be attributed at least in 
part to mutant IDH-induced inhibition of TET2, an o-ketoglutarate 
(aKG)-dependent enzyme potentially involved in DNA demethyla- 
tion’®. Although DNA hypermethylation has been associated with 
IDH mutation in glioma samples’, no mutations in TET family mem- 
bers have been found in this disease. We explored the possibility that 
IDH mutation may affect additional «KG-dependent enzymes that 
contribute to the regulation of cell differentiation. 

Histone lysine methylation is an integral part of the post- 
translational modifications of histone tails that are important for 
chromatin organization and regulation of gene transcription'®’. In 
vitro 2HG can competitively inhibit a family of «KG-dependent 
Jumonji-C domain histone demethylases (JHDMs)'*"*. To determine 
whether IDH-associated changes in histone methylation could be 
observed in cells, we ectopically expressed wild-type or mutant 
IDH1 or IDH2 in 293T cells and found that mutant IDH1 or IDH2 
led to a marked increase in histone methylation compared to the wild- 
type enzymes. Transient transfection of wild-type IDH2 can also lead 
to increased 2HG production’. In all of the samples, the magnitude of 
increase in methylation correlated with the intracellular 2HG levels 
produced by IDH transfection (Fig. 1b and Supplementary Fig. 1). To 
test whether histone lysine methylation was dysregulated in gliomas 
with IDH mutation, immunohistochemistry analysis of patient oligo- 
dendroglioma samples was performed for several well-characterized 
histone marks. Compared to tumours with wild-type IDH, there was a 
statistically significant increase in the repressive trimethylation of 
H3K9 (H3K9me3) and an increasing trend in trimethylation of 
H3K27 (H3K27me3) in tumours with IDH1 mutation (Fig. 1c). No 
statistically significant difference was seen in trimethylation of H3K4 
(H3K4me3), a mark associated with active transcription (data not 
shown). These data suggested that IDH mutations might preferentially 
affect the regulation of repressive histone methylation marks in vivo. 

As IDH mutations were associated with glial tumours of the ‘pro- 
neural’ phenotype’’, we sought to determine whether the persistence of 
histone repressive marks promoted by mutant IDH was sufficient to 
block the differentiation of non-transformed cells. Upon stimulation 
with a differentiation cocktail, immortalized murine 3T3-L1 cells 
undergo extensive chromatin remodelling, resulting in their matura- 
tion into adipocytes'’. 3T3-L1 cells transduced with R172K mutant 
IDH2 produced 2HG whereas cells transduced with either wild-type 
IDH2 or vector alone did not (Fig. 2a). All three cell types were then 
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Figure 1 | IDH mutations are associated with dysregulation of glial 
differentiation and global histone methylation. a, Heatmap representation of 
a two-dimensional hierarchical clustering of genes identified as differentially 
expressed between IDH mutant (mut) patient oligodendroglioma samples and 
IDH wild-type (WT) samples. Each row represents a gene and each column 
represents a specimen. IDH mutational status, tumour grade and recurrence of 
each sample are listed. b, 293T cells transfected with empty vector (Vec), wild- 
type or R132H mutant IDH1, or wild-type or R172K mutant IDH2 for 3 days 
were lysed and assessed for expression levels of IDH1, IDH2 and histone lysine 


induced to differentiate into adipocytes. After 7 days of differentiation 
induction, IDH mutant cells had visibly reduced lipid droplet accu- 
mulation compared to vector and IDH wild-type cells, as shown by 
Oil-Red-O staining (Fig. 2b). In separate experiments, stable transfec- 
tion of R140Q mutant IDH2 also resulted in a block to adipocyte 
differentiation (data not shown). To determine whether 2HG was 
sufficient to mediate the effect of mutant IDH on cell differentiation, 
we synthesized cell-permeable 1-octyl-p-2-hydroxyglutarate (octyl- 
2HG; Supplementary Fig. 2). Treatment of 3T3-L1 cells with octyl- 
2HG led to a dose-dependent inhibition of lipid accumulation (Fig. 2c). 
Gene expression analysis showed that despite exposure to a well- 
standardized differentiation protocol’*, IDH mutant cells or cells 
treated with octyl-2HG exhibited a profound defect in the expression 
of transcription factors essential for executing adipogenesis (Cebpa and 
Pparg) and an adipocytic lineage-specific gene (Adipoq) (Fig. 2d, e), 
suggesting that these cells failed to execute adipocyte differentiation. 

Cells were harvested for a chromatin immunoprecipitation (ChIP) 
assay using antibodies against H3K9me3 and H3K27me3, before or 
after 4 days of differentiation induction. Quantitative polymerase 
chain reaction (PCR) with primers targeting promoters of Cebpa 
and Adipog revealed that at day 4 there was a statistically significant 
increase in H3K9me3 and H3K27me3 at promoters of both genes in 
IDH mutant cells (Fig. 2f). These repressive marks also showed a 
modest but significant increase at gene promoters before differenti- 
ation induction. In contrast, quantitative assessment of DNA methyla- 
tion at promoters of Cebpa and Adipog by MassARRAY failed to reveal 
any significant difference between IDH wild-type and mutant cells 
(Supplementary Fig. 3). In addition to gene-specific changes, we 
detected a global increase in H3K9 methylation and a reciprocal 
decrease in H3 acetylation (Fig. 2g and Supplementary Fig. 4). 

To determine whether IDH mutation was sufficient to induce 
enhanced repressive histone methylation in central nervous system 


methylation by western blotting with specific antibodies. Total H3 was used as 
loading control. Quantification of band intensities is shown in Supplementary 
Fig. 1. Bottom panel provides quantification of intracellular 2HG to glutamate 
ratio (2HG/glutamate) as previously reported for these transfectants’. 

c, Immunohistochemistry staining with antibodies against H3K9me3 and 
H3K27me3 in IDH1/2 wild-type and IDH1 mutant oligodendroglioma samples 
(X40 magnification). Image quantification was performed using Metamorph 
software (see Methods) and shown in bottom panels. Error bars represent 
standard deviation (s.d.) of at least three patient samples in each group. 


(CNS)-derived cells and whether it was associated with altered neural 
gene expression, we retrovirally transduced immortalized normal 
human astrocytes (NHAs) with either wild-type or R132H mutant 
IDH1. Compared to parental cells, late-passage cells expressing 
mutant IDH exhibited elevated levels of a variety of histone methyla- 
tion marks (Fig. 3a), and this correlated with an enhanced expression 
of the neural marker nestin (Fig. 3b). IDH mutations have been asso- 
ciated with CpG-island hypermethylation’ and consistent with this we 
observed that total CpG methylation was increased in IDH mutant 
cells (Supplementary Fig. 5). Because histone repressive marks can 
promote DNA methylation and vice versa'’, we studied the temporal 
relationship of histone and DNA methylation in IDH-expressing 
astrocytes (Fig. 3c-e and Supplementary Fig. 5). The first observable 
change of the histone marks we examined was H3K9me3. H3K9me3 
levels were significantly elevated by passage 12 after cells were infected 
with mutant IDH. Changes in other histone methylation marks were 
either delayed and of lower magnitude (H3K27me3 and H3K79me2) 
or were not observed (H3K4me3). Increases in DNA methylation were 
never observed before passage 17 and the difference in DNA methyla- 
tion reached statistical significance only at passage 22. 

To test whether the IDH1 R132H mutation could interfere with 
neural differentiation in the absence of prolonged adaptation in culture, 
primary neurosphere cultures established from the brains of p16/ 
p19’~ mice were infected with a retroviral construct containing 
IDH1 R132H mutant, wild-type IDH1 or the vector alone (Supplemen- 
tary Fig. 6). After infection the cells were re-plated under conditions that 
promote astrocyte differentiation and induced to differentiate further 
by treatment with retinoic acid without further passaging. IDH mutant 
cells failed to induce expression of the astrocytic marker GFAP and 
exhibited expression of the neural marker B3-tubulin (Fig. 3f). When 
the differentiation conditions were supplemented with retinoic acid, 
enhanced expression of the astrocytic marker GFAP was observed in 
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Figure 2 | Differentiation arrest induced by mutant IDH or 2HG is 
associated with increased global and promoter-specific H3K9 and H3K27 
methylation. a, 3T3-L1 cells stably expressing empty vector, wild-type, or 
R172K mutant IDH2 were lysed and assessed for expression levels of IDH2 or 
IDHI by western blotting. Cells were also extracted for intracellular 
metabolites, which were then MTBSTFA-derivatized (see Methods) and 
analysed by gas chromatography—mass spectrometry (GC-MS). The 
quantification of 2HG signal intensity relative to the intrasample glutamate 
signal is shown for a representative experiment. b, Cells were induced to 
differentiate into mature adipocytes for 7 days. The accumulation of lipid 
droplets was assessed by Oil-Red-O staining. Wells from a representative 
experiment from a total of four independent experiments are shown. c, 3T3-L1 
cells were induced to differentiate for 7 days in the absence or presence of 1 mM 
or 2mM octyl-2HG. Oil-Red-O staining was performed and quantified by 
measuring absorbance at 500 nm. DMSO, dimethylsulphoxide. d, Vector, wild- 
type or R172K mutant IDH2 transduced 3T3-L1 cells were induced to 
differentiate for 4 days. At days 0 and 4 (d0 and d4), RNA was extracted. Relative 


IDH wild-type and vector cells but GFAP expression remained 
repressed in IDH mutant cells. 

The enhancement of H3K9 methylation in mutant IDH-expressing 
cells from multiple tissues of origin led us to investigate whether this 
H3K9 methylation might be sufficient to block the ability of non- 
transformed cells to execute differentiation. Support for this hypothesis 
came with the discovery that KDM4C (also known as JMJD2C), 
an H3K9-specific JHDM, was induced in 3T3-L1 cells during differ- 
entiation (Fig. 4a). An in vitro histone demethylase assay with recom- 
binant human GST-tagged KDMA4C confirmed that KDM4C effec- 
tively removed H3K9me2 and H3K9me3 in the presence of «KG. 
Importantly, the demethylation reaction was inhibited by 2HG in a 
dose-dependent manner (Fig. 4b and Supplementary Fig. 7). Given 
the similarities between 2HG and aKG, the inhibition of KDM4C by 
2HG would be predicted to be competitive. Consistently, increasing the 
concentration of «KG in the reaction mixture reversed the inhibition of 
H3K9 demethylation by 2HG (Fig. 4c). 

Finally, to test the possibility that H3K9 demethylation is a required 
component of adipocyte differentiation, we examined whether blocking 
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expression of adipocyte-specific gene and transcription factors was assessed by 
quantitative PCR with reverse transcription (RT-—qPCR). e, 3T3-L1 cells were 
induced to differentiate for 4 days in the absence or presence of 1 mM or 2mM 
octyl-2HG. RNA was extracted. Relative expression of adipocyte-specific gene 
and transcription factors was assessed by RT-qPCR. f, Vector, wild-type or 
R172K mutant IDH2 transduced 3T3-L1 cells were induced to differentiate. At 
days 0 and 4 (d0 and d4), ChIP analysis was performed using antibodies against 
H3K9me3 and H3K27me3. Immunoprecipitated Cebpa and Adipog promoter 
sequences were analysed by qPCR and shown as percentage of input. g, Vector, 
wild-type or R172K mutant IDH2 transduced 3T3-L1 cells were induced to 
differentiate for 4 days. At days 0 and 4 (dO and d4), histones were acid- 
extracted and levels of H3K9me3, H3K9me2 and acetyl-H3 were assessed by 
western blotting with specific antibodies. Total H3 was used as loading control. 
Quantification of band intensities is shown in Supplementary Fig. 4. In f, error 
bars indicate s.d. from triplicate wells and a representative experiment from a 
total of two is shown. For all other experiments, error bars indicate s.d. from 
three independent experiments. *P < 0.05; **P < 0.01; NS, not significant. 


the induction of KDMA4C was sufficient to impair the differentiation of 
3T3-L1 cells. Treatment with three independent short interfering 
RNAs (siRNAs) against KDM4C reduced its expression and enhanced 
H3K9me3 in 3T3-L1 cells (Fig. 4d and Supplementary Fig. 8). After 
differentiation induction, cells treated with KDM4C siRNAs exhibited 
reduced ability to differentiate into adipocytes. Thus the inability to 
erase repressive H3K9 methylation can be sufficient to impair the 
differentiation of non-transformed cells. 

Biochemical studies suggest that 2HG is a universal inhibitor of 
JHDM family members’*"’; therefore it was interesting to observe 
that H3K9 demethylation seemed to be more sensitive to mutant 
IDH-induced suppression than at least some other histone methyla- 
tion marks. Future investigation of the sensitivity to 2HG inhibition 
among JHDM family members and/or cellular feedback mechanisms 
activated after defective histone demethylation will be needed. In addi- 
tion to the data presented here, evidence is mounting for a direct role 
of histone methylation in stem cell maintenance, differentiation and 
tumorigenesis'’”*? (see Supplementary Discussion). Our findings 
support a role for xKG-dependent demethylases in cell differentiation 
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Figure 3 | IDH mutation induces histone methylation increase in CNS- 
derived cells and can alter cell lineage gene expression. a, Immortalized NHA 
cells were retrovirally transduced with constructs containing wild-type or 
R132H mutant IDH1. Histones were acid-extracted from parental cells or cells 
expressing wild-type or mutant IDH1 at late (>40) passages. Histone lysine 
methylation levels were assessed by western blotting with specific antibodies. 
Total H3 was used as loading control. Images presented are panels from 
different areas of the same gel. b, Parental, IDH1 wild-type and R132H mutant 
NHA cells at late passages were lysed and assessed for expression levels of nestin 
by western blotting. p85 was used as loading control. Images presented are 
panels from different areas of the same gel. c, NHA cells were retrovirally 
transduced with constructs containing wild-type or R132H mutant IDH1. 
Histones were acid-extracted at different time points as cells were passaged in 
culture. Histone lysine methylation levels were assessed by western blotting 
with specific antibodies. Total H3 was used as loading control. Images 
presented are panels from different areas of the same gel. d, Western blot band 
intensities of H3K9me3 in c and two additional independent experiments were 
quantified using Image J. Red squares indicate IDH1 wild-type cells. Green 
triangles indicate IDH1 R132H mutant cells. e, Total CpG methylation of IDH1 
wild-type and R132H mutant NHA cells at various passages was measured by 
FACS using 5-methylcytosine-specific antibody and shown as normalized 
mean fluorescence intensity. FACS histograms from a representative 
experiment are shown in Supplementary Fig. 5. f, Neurosphere cultures 
established from the subventricular zone of brains of p16/p19 ’~ mice were 
infected with a retroviral construct containing IDH1 R132H mutant, wild-type 
IDH1 or the vector alone and induced to differentiate. GFAP and $3-tubulin 
expression levels were assessed by western blotting. p85 was used as loading 
control. In d and e, error bars indicate standard error of the mean from three 
independent experiments. *P < 0.05; **P < 0.01. 
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Figure 4 | 2HG-inhibitable H3K9 demethylase KDMAC is required for cell 
differentiation. a, 3T3-L1 cells were induced to differentiate (Diff) for 3 days. 
Before (d0) and each day after differentiation induction, cells were lysed and 
KDM4C protein levels were assessed by western blotting with specific antibody. 
Tubulin was used as loading control. b, Bulk histones were incubated with 
purified GST-tagged human KDM4C in the reaction mix with 1 mM «KG and 
increasing concentrations of D-2HG. Levels of GST tag, H3K9me3 and 
H3K9me2 were assessed by western blotting with specific antibodies. Total H3 
was used as loading control. c, Bulk histones were incubated with purified GST- 
tagged human KDM4C in the reaction mix. 10 mM D-2HG was added to inhibit 
the demethylation reaction in the presence of increasing concentrations of 
aKG., Levels of H3K9me3 were assessed by western blotting with specific 
antibody. Total H3 was used as loading control. d, 3T3-L1 cells were transfected 
with control siRNA or siRNA specific for KDMAC. After 3 days, cells were lysed 
and assessed for expression levels of KD M4C and H3K9me3 by western blotting 
with specific antibodies. Total H3 was used as loading control. Cells from the 
same treatment were induced to differentiate for 7 days. The accumulation of 
lipid droplets was assessed by Oil-Red-O staining. Wells from a representative 
experiment from a total of three independent experiments are shown. 
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that can be impaired through the cellular accumulation of 2HG pro- 
duced by IDH mutation. 


METHODS SUMMARY 


Details about histone extraction, GC-MS and ChIP assay can be found in 
Methods. In brief, 3T3-L1 cell differentiation, Oil-Red-O staining, and in vitro 
histone demethylase assay were performed as previously described**”’. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Patient samples, microarray, gene-ontology analysis. Primary oligodendroglioma 
samples were obtained with approval from the institutional review board at the 
University of Pennsylvania and were de-identified for the study. For microarray 
analysis, tumour sample RNA was extracted with Trizol and purified with Qiagen 
RNeasy, and then assayed on an Affymetrix Human Gene 1.0ST array. Significance 
Analysis of Microarrays (http://www-stat.stanford.edu/~tibs/SAM/sam.pdf) was 
applied to find differentially expressed genes (q value <10% and fold change >2). 
Functional analysis of differentially expressed genes was done using the DAVID tool 
(http://david.abcc.ncifcrf.gov/home.jsp) using all human genes as a background set. 
3T3-L1 cell differentiation, Oil-Red-O staining. 3T3-L1 cell differentiation and 
Oil-Red-O staining were carried out as described previously. In brief, confluent 3T3- 
LI cells were stimulated with a cocktail containing 0.5 mM isobutylmethylxanthine, 
1 uM dexamethasone, 5 jig/ml insulin and 5 11M troglitazone (all from Sigma) to 
induce differentiation. Cells were maintained in medium with insulin after 2 days 
of differentiation until ready to be harvested. For Oil-Red-O staining, cells were 
washed in PBS and then fixed for 20 min at room temperature (25 °C) with 3% 
paraformaldehyde. Cells were then washed with de-ionized water and stained 
with Oil-Red-O solution. For quantification, Oil-Red-O staining was dissolved 
in isopropanol and absorbance was measured at 500 nm. 

In vitro histone demethylase assay. The histone demethylase assay was carried 
out as described previously”. In brief, 4 jig bulk calf thymus histones (Sigma) were 
incubated with GST-tagged KDMA4C (1.42 jig; BPS Bioscience) in a reaction mix 
containing 50mM Tris-HCl pH 8.0, protease inhibitors cocktail, 1mM «KG, 
100 [tM FeSO, and 2 mM ascorbic acid at 37 °C for 4h, in the absence or presence 
of various concentrations of D-2HG or L-2HG (Sigma). Reaction mixtures were 
analysed by western blotting using specific antibodies. 

Cell culture, transfection and transduction, generation of cell lines. 293T cells, 
NHA cells immortalized by E6/E7/hTERT (provided by R. Pieper*®) and 3T3-L1 
cells were cultured in Dulbecco’s modified Eagle’s medium (DMEM; Invitrogen) 
with 10% fetal bovine serum (FBS; CellGro). For expression of wild-type and 
mutant IDH1/2 in 293T cells, transfection was performed with Lipofectamine 
2000 (Invitrogen) according to the manufacturer’s instructions. For generation 
of IDH2 retrovirus and transduction of 3T3-L1 cells, supernatant from 293T cells 
transfected with pCL-Eco helper virus and plasmids was collected after 72h, 
filtered and applied to cells overnight. For generating 3T3-L1 cell lines with 
stable expression of wild-type or mutant IDH2, cells were grown in 2.5,gml | 
puromycin for 7 days after retroviral transduction. Pooled populations of 
puromycin-resistant cells were obtained, and then continuously cultured in 
puromycin. For generation of IDH1 retrovirus and transduction of NHA cells, 
GP2-293 cells (Clontech, 631458) were calcium phosphate transfected with equal 
amounts of pVSV-G (Clontech, 631512) and plasmids. Virus was harvested at day 
2 and day 3 after transfection and placed on logarithmically growing cells. After 
infection, cells were placed in 800 1g ml’ G418 (Invitrogen) to generate stable 
cell lines. For siRNA knockdown of KDMAC, transfections were performed with 
Lipofectamine RNAiMAX (Invitrogen), using siRNAs targeting KDM4C 
(#1: sense, 5'-GCUUGAAUCUCCCAAGAUATT-3’; antisense, 5’-UAUCUU 
GGGAGAUUCAAGCTT-3’; #2: sense, 5‘-CAAAGUAUCUUGGAUCAAATT-3’; 
antisense, 5’- UUUGAUCCAAGAUACUUUGCC-3’; #3: sense, 5’-GAGGAGUU 
UCGGGAGUUCAACAAAU-3’; antisense, 5’-AUUUGUUGAACUCCCGAA 
ACUCCUC-3’) or a non-targeting control (Dharmacon, #D-001810-01-20) at a 
concentration of 40 nM. 

Mutational analysis. For IDH mutation analysis, tumour genomic DNA was 
extracted and the regions surrounding IDH1 codon 132 and IDH2 codons 140 
and 172 were amplified by PCR followed by sequencing. IDH1 analysis used forward 
primer 5'-ACCAAATGGCACCATACGA-3’ and reverse primer 5'-TTCATACC 
TTGCTTAATGGGTGT-3’ for amplification, and primer 5’-CGGTCTTCAGAG 
AAGCCATT-3’ for sequencing’. IDH2 analysis used forward primer 5’-CAG 
AGACAAGAGGATGGCTAGG-3’ and reverse primer 5'-GTCTGCCTGTG 
TTGTTGCTTG-3’ for amplification, and the same forward primer for sequen- 
cing’. Out of the 42 tumours analysed, 41 had sufficient high quality genomic 
DNA for discerning IDH mutation status. The one sample unable to be classified as 
either IDH wild type or mutant was excluded from further analysis. 

Plasmid construction. The cDNA clone of human IDH1 (BC012846.1) was 
purchased from the American Type Culture Collection in pCMV-Sport6, and 
human IDH2 (BC009244) was purchased from Invitrogen in pOTB7. Standard 
site-directed mutagenesis techniques were used to generate IDH1 R132H by 
introducing a G395A base-pair change in the IDH1 open reading frame (ORF). 
IDH2 R172K was made by introducing a G515A change in the IDH2 ORF, while 
IDH2 R140Q was made with a G419A alteration. Wild-type and mutant sequences 
were then subcloned into LPC vector. All sequences were confirmed by direct 
sequencing before expression in 293T cells and retrovirus generation. Retroviral 
constructs used for neurosphere infection were generated by excising wild-type 
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IDH1 and IDH1 R132H with NotI and Pacl restriction enzymes from the previ- 
ously made vectors and incorporating into the pQCXIH (Clontech, 631516) 
retroviral vector. 

Histone extraction and western blotting. For histone acid extraction, cells were 
lysed in hypotonic lysis buffer (10 mM HEPES, 10mM KCl, 1.5mM MgCh, 
0.5mM DTT, protease inhibitors) for 1h. H,SO4 was added to 0.2 N overnight 
at 4°C with rotation. After spinning down and collecting supernatant, proteins 
were precipitated in 33% TCA, washed with acetone, and resuspended in 
de-ionized water. For whole-cell lysates, cells were lysed and sonicated in standard 
RIPA buffer (1% sodium deoxycholate, 0.1% SDS, 1% Triton X-100, 0.01 M Tris 
pH 8.0 and 0.14 M NaCl), and lysates were then centrifuged at 14,000g at 4 °C for 
10 min. Supernatants were collected and measured for total protein concentration. 
For western blotting, lysates were separated by SDS-PAGE, transferred to 
nitrocellulose membrane, blocked in 5% non-fat milk in PBS containing 0.5% 
Tween-20, probed with primary antibodies and detected with horseradish- 
peroxidase-conjugated anti-rabbit or anti-mouse antibodies (GE Healthcare, 
NA934V and NA931V). Primary antibodies used were: anti-IDH1 (Proteintech, 
12332-1-AP), anti-IDH2 (Abcam, ab55271), anti-GST tag (Millipore, 05-311), 
anti-H3K9me2 (Cell Signaling Tech, 9753), anti-H3K9me3 (Abcam, ab8898), 
anti-H3K36me3 (Abcam, ab9050), anti-H3K27me3 (Millipore, 17-622), anti- 
H3K4me3 (Millipore, 17-614), anti-H3K79me2 (Cell Signaling Tech, 9757), 
anti-KDMA4C (Abcam, ab85454), anti-acetyl H3 (Upstate, 06-599), anti-H3 (Cell 
Signaling Tech, 4499), anti-tubulin (Sigma, T9026), anti-GFAP (Cell Signaling Tech, 
3670), anti-B3-tubulin (Cell Signaling Tech, 5666), anti-p85 (Millipore, 06-195), 
anti-nestin (Millipore, MAB5326), anti-B-actin (Sigma, A5316). Anti-IDH1 
R132H mutant antibody was a gift from Agios Pharmaceuticals. Quantification 
of western blot band intensity was performed using Image J software according to 
the manufacturer’s instructions. 

Metabolite extraction, GC-MS. After gentle removal of culture medium, cells 
were rapidly quenched with ice-cold 80% methanol and incubated at —80°C for 
20 min. After sonication, extracts were then centrifuged at 14,000g for 20 min at 
4° C to remove precipitated protein and the aqueous metabolites in the supernatant 
layer were dried under nitrogen gas. For 293T cells, organic acids were further 
purified by redissolving the dried extract in de-ionized water, followed by elution 
from an AG-1 X8 100-200 anion exchange resin (Bio-Rad) in 3 N HCl after wash- 
ing with five column volumes. 

For GC-MS analysis, dried extracts were redissolved in a 1:1 mixture of 
acetonitrile and N-methyl-N-tert-butyldimethylsilyltrifluoroacetamide (MTBSTFA; 
Regis) and heated for 75 min at 70 °C to derivatize metabolites. Samples were then 
injected into an Agilent 7890A GC with an HP-5MS capillary column, connected to 
an Agilent 5975 C mass selective detector operating in splitless mode using electron 
impact ionization with ionizing voltage of —70eV and electron multiplier set to 
1,060 V. GC temperature was started at 100 °C for 3 min, ramped up to 230°C at 
4°Cmin ' and held for 4 min, then ramped up to 300 °C and held for 5 min. Mass 
range of 50-500 AMU was recorded at 2.71 scans per second. Identification of the 
2HG metabolite peak was confirmed using standards obtained from Sigma. The 
2HG and glutamate signal intensities were quantified by integration of peak areas. 
Quantitative real-time PCR. RNA was isolated using Trizol (Invitrogen). After 
incubating with DNase, cDNA was synthesized using Superscript II reverse tran- 
scriptase (Invitrogen). Quantitative PCR was performed on a 7900HT Sequence 
Detection System (Applied Biosystems) using Taqman Gene Expression Assays 
(Applied Biosystems). Gene expression data was normalized to 18S rRNA. 
ChIP. ChIP was performed with the Millipore Magna ChIP G kit (Millipore, 17- 
611). In brief, 2,000,000 cells were cross-linked with 1% formaldehyde for 10 min 
at room temperature. After washing with cold PBS, cells were centrifuged and 
lysed in 500 pl SDS lysis buffer for 10 min on ice. Lysate was then sonicated using 
Bioruptor sonicator (Diagenode) to shear DNA to approximately 200-600 bp. 
Samples were spun down and 50 ul of the supernatant was used for each immuno- 
precipitation overnight with magnetic beads after 10X dilution. Primary antibodies 
(3 ug per ChIP) used were: anti- H3K9me3 (Abcam, ab8898) and anti-H3K27me3 
(Millipore, 17-622). Normal rabbit IgG (Millipore, 12-370) was used as control and 
showed minimal enrichment. The next day, samples were washed in low-salt 
immune complex buffer, high-salt immune complex buffer, LiCl immune complex 
buffer and TE buffer. Histone complexes were eluted in elution buffer plus 
proteinase K for 2h at 65°C. DNA was recovered using columns. Quantitative 
PCR was performed on purified DNA samples. Primers used are: Adipog forward, 
5'-ATGGCTGAACCACACAGCTTCA-3’; reverse, 5'-AGGGGTCAGGAGA 
CCTCCCTTT-3'; Cebpa forward, 5'-CTGGAAGTGGGTGACTTAGAGG-3’; 
reverse, 5'-GAGTGGGGAGCATAGTGCTAG-3’. Data points (Ct) are converted 
to percentage of input. 

Quantitative DNA methylation analysis. Matrix-assisted laser desorption/ 
ionization time-of-flight mass spectrometry using EpiTlyper by MassARRAY 
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(Sequenom) was performed on bisulphite-converted DNA extracted from 3T3-L1 
cells. MassARRAY primer design was done as previously described**”’. 
Immunohistochemistry. Immunohistochemistry detection was performed using 
Discovery XT processor (Ventana Medical Systems). The tissue sections were 
blocked for 30 min in 10% normal goat serum in 0.2% BSA/PBS, followed by 
incubation for 5h with 0.1pgml~! of the rabbit polyclonal anti-H3K9me3 
(Abcam, ab8898) or 1 pg ml! rabbit polyclonal anti-H3K27me3 (Millipore, 
07-449) antibodies and incubation for 60 min with biotinylated goat anti-rabbit 
IgG (Vector labs, PK6101) at 1:200 dilution. The detection was performed with 
the DAB-MAP kit (Ventana Medical Systems). The entire slides were scanned 
by Zeiss Mirax Scan (Carl Zeiss) using a X20/0.8 objective. The scanned 
image was exported into image analysis software, Metamorph (Molecular 
Devices). The colour threshold for DAB-positive nuclei was determined 
and set for all images. Areas above the threshold for the DAB signal and 
for haematoxylin-counterstained total nuclei were measured in an automated 
fashion. The ratio between the two parameters were calculated and analysed for 
statistical significance. 

Synthesis of 1-octyl-p-2-hydroxyglutarate:. Commercial R(-)-tetrahydro-5- 
oxofuran-2-carboxylic acid (140 mg, 1.076 mmol) was dissolved in H,O (1 ml), 
cooled to 0°C and treated with 1N KOH (2.16 ml, 2.15 mmol). The resulting 
solution was stirred at this temperature for 5 min and at ambient temperature 
for 2 h. It was then concentrated to dryness under reduced pressure and dried. The 
residue was dissolved in trifluoroacetic anhydride (8 ml) at 0 °C, stirred for 30 min 
at 0°C, for 2h at room temperature, then the volatiles were evaporated under 
reduced pressure. The residue was dried and dissolved in anhydrous tetrahydrofuran 
(6 ml). Octanol (0.3 ml, 2.1 eq.) was added to the solution at 0 °C and the mixture 
was stirred for an overnight period at ambient temperature. Water was added to 
quench the reaction, and the mixture extracted with EtOAc. The combined extracts 
were dried over MgSO,, concentrated and purified by Flash chromatography 
(EtOAc:hexane 1:3 and 1:1) to give 1-octyl-b-2-hydroxyglutarate (110 mg, 39%). 
Neurosphere isolation, culture and differentiation. Six days postpartum Ink4a/ 
Arf null (p16/p19 ‘~) mice were killed, with the isolated subventricular zones 
subjected to chemical (Pronase E, Calbiochem 7433-2) and mechanical dissoci- 
ation to obtain a single-cell suspension in full neurobasal medium (Neurobasal 
medium, GIBCO 21103; B27 supplement without retinoic acid, GIBCO 12587-010; 


Glutamax, GIBCO 35050; 20ngml~' EGF, R&D Systems 236-EG; 20ngml * 
basic FGF, Millipore GF003). On the next day the cells were spun down and 
re-suspended in fresh medium, and once neurospheres had formed in culture, 
the spheres were collected and chemically dissociated (Accumax, Innovative Cell 
Technologies AM105) back into single cells in fresh medium. 

One day after final infection, infected neurospheres and a non-infected control 
were placed in 400 pg ml’ Hygromycin B (InvivoGen, ant-hg-1). Once selection 
was complete, isogenic cell lines maintained in full neurobasal medium were 
chemically dissociated into single cells and plated at the same density in full 
neurobasal medium with increasing concentrations of retinoic acid (Sigma- 
Aldrich, R2625). Seventy-two hours later cells were harvested, and expression of 
proteins was analysed by western blotting. 

Measurement of total CpG methylation. DNA methylation was assessed as 
previously described*’. In brief, 1 X 10° NHA cells were washed with PBS and 
fixed with 2% paraformaldehyde for 10 min at room temperature and permeabilized 
with 0.5% Triton X-100 for 10 min. Cells were then treated with 2 N HCl for 20 min 
at room temperature and subsequently neutralized with 100 mM Tris-HCl, pH 8.0. 
Cells were incubated with anti-5-methylcytosine antibody (Calbiochem, NA 81) at 
1:100 dilution for 30 min at room temperature. After washing with PBS, cells were 
incubated with secondary antibody coupled with ALEXA FLUOR 488 (Invitrogen) 
for 30 min in the dark. Flow cytometry was done using Becton Dickinson Calibur 
flow cytometer and analysed using FlowJo software. 

Statistical analysis. All statistical analysis was performed using Student’s t-test 
(two-sample equal variance; two-tailed distribution). 
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IDH1 mutation is sufficient to establish the glioma 
hypermethylator phenotype 
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Both genome-wide genetic and epigenetic alterations are fun- 
damentally important for the development of cancers, but the 
interdependence of these aberrations is poorly understood. 
Glioblastomas and other cancers with the CpG island methylator 
phenotype (CIMP) constitute a subset of tumours with extensive 
epigenomic aberrations and a distinct biology’*. Glioma CIMP 
(G-CIMP) is a powerful determinant of tumour pathogenicity, 
but the molecular basis of G-CIMP remains unresolved. Here we show 
that mutation of a single gene, isocitrate dehydrogenase 1 (IDH1), 
establishes G-CIMP by remodelling the methylome. This remodelling 
results in reorganization of the methylome and transcriptome. 
Examination of the epigenome of a large set of intermediate- 
grade gliomas demonstrates a distinct G-CIMP phenotype that is 
highly dependent on the presence of IDH mutation. Introduction 
of mutant IDH1 into primary human astrocytes alters specific 
histone marks, induces extensive DNA hypermethylation, and 
reshapes the methylome in a fashion that mirrors the changes 
observed in G-CIMP-positive lower-grade gliomas. Furthermore, 
the epigenomic alterations resulting from mutant IDH1 activate 
key gene expression programs, characterize G-CIMP-positive 
proneural glioblastomas but not other glioblastomas, and are 
predictive of improved survival. Our findings demonstrate that 
IDH mutation is the molecular basis of CIMP in gliomas, provide 
a framework for understanding oncogenesis in these gliomas, and 
highlight the interplay between genomic and epigenomic changes in 
human cancers. 

The isocitrate dehydrogenase genes [DH1 and IDH2 are mutated in 
>70% of lower-grade gliomas (grades II and III), in some glioblastomas*”, 
and in leukaemias and several other cancers®’. The most common IDH1 
mutations in glioma (>95%) result in an amino acid substitution at 
arginine 132 (R132), which resides in the enzyme’s active site. Mutation 
of IDH imparts the ability to produce 2-hydroxyglutarate (2-HG), a 
potential oncometabolite*’. Alterations in the methylation landscape 
have been shown to have important roles during oncogenesis''. CIMP 
has emerged as a distinct molecular subclass of tumours in a number of 
human malignancies, including glioblastoma'*. This phenotype is 
associated with extensive, coordinated hypermethylation at specific 
loci'*"*"*. In glioblastomas, G-CIMP is associated with the proneural 
subgroup of tumours and IDH mutation’. Exactly how mutant IDH 
promotes tumorigenesis and causes G-CIMP—or CIMP in any type of 
human cancer—is unknown. 

To determine whether IDH1 mutation directly causes G-CIMP, we 
used immortalized primary human astrocytes'’* and constructed 
isogenic cells expressing either mutant IDH1 (R132H), wild-type IDH1, 
or neither. These astrocytes are well characterized’*’. Introduction of 


wild-type IDH1 and the R132H IDH1 mutant resulted in equal expres- 
sion of protein (modest threefold increase) (Fig. 1a). Expression of 
mutant but not wild-type IDH1 in human astrocytes resulted in the 
production of 2-HG (Fig. 1b). To determine whether mutant IDH1 
altered the methylation landscape, we analysed genomic DNA from 
these cells using the Illumina Infinitum HumanMethylation450 platform. 
The platform provides genome-wide coverage and is both well 
validated and highly reproducible’*”. 

Previous data demonstrated that de novo DNA methylation in 
in vitro models occurs over extended periods, requiring time to ‘lock 
in’ epigenomic changes'*”°. We thus analysed the methylomes of 
astrocytes expressing mutant or wild-type IDH1 over successive 
passages (up to 50). Analysis using self-organizing maps demonstrated 
that mutant IDH1 progressively remodelled the glial methylome over 
time (Fig. 1c, d), an effect that was not seen in control astrocytes. 
Expression of mutant IDH1 caused a marked increase in hypermethy- 
lation at a large number of genes, although there was a small group of 
hypomethylated genes as well (Fig. le and Supplementary Fig. 1a and 
Supplementary Table 1). Surprisingly, expression of wild-type IDH1 
also reshaped the methylome but in a manner that differed from effects 
due to expression of mutant IDH1 (Fig. 1f). Expression of wild-type 
IDH1 caused hypomethylation at specific loci, suggesting that both the 
production of 2-HG and the levels of «-ketoglutarate can affect the 
methylome. Unsupervised hierarchical clustering of the methylome 
data showed that the hypermethylated genes included both genes that 
underwent de novo methylation as well as genes that originally 
possessed low levels of methylation but subsequently acquired high 
levels of methylation (Fig. le). Control astrocytes did not undergo 
these methylome changes (Fig. 1c, d). Mutant IDH1-induced remodel- 
ling of the methylome was progressive and reproducible, and resulted 
in significant changes in gene expression (Fig. 1f and Supplementary 
Fig. 1a, Supplementary Tables 2 and 3). 

We sought to define the methylation targets of mutant IDH in 
astrocytes. Of the 44,334 CpG sites that were differentially methylated 
in mutant IDH-expressing cells, 30,988 sites were hypermethylated 
(3,141 unique genes with promoter CpG island methylation changes; 
Supplementary Table 1). Transcriptional module mapping showed 
that the genes undergoing methylation changes were highly enriched 
for polycomb complex 2 (PRC2)-targeted loci (Supplementary Fig. 1b 
and Supplementary Table 4)'**". These observations demonstrate that 
mutant IDH1 is sufficient to reshape the epigenome by altering the 
global methylation landscape. 

Lower-grade gliomas (LGGs; World Health Organization grades II 
and II) and secondary glioblastomas are biologically distinct from 
primary or de novo glioblastomas’’. Present knowledge of G-CIMP 
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Figure 1 | Introduction of mutant IDH1 into human astrocytes remodels 
the methylome. a, Expression of wild-type and mutant IDH1 (R132H) in 
immortalized human astrocytes (passage 5). b, Overexpression of mutant 
IDH1 but not wild-type (WT) IDH1 in human astrocytes leads to production of 
2-HG*. Error bars show 1 standard deviation (s.d.) (n = 2). ¢, Self-organizing 
map (SOM) analysis of methylome data for wild-type IDH1-expressing, 
mutant IDH1-expressing (R132H), and parental (control) cell lines shows 
changes in the methylome in mutant IDH1-expressing and wild-type IDH1- 
expressing astrocytes, compared to parental cells. Mosaic patterns are pseudo- 
coloured SOMs from different time points (P indicates passage number). Tile 
colours indicate methylation level of centroids. d, Hierarchical clustering 
showing divergence of the methylome of IDH1-expressing astrocytes from that 
of parental astrocytes. MUT, mutant; PAR, parental. e, Heatmap showing the 
10,678 most significant differentially methylated probes (ANOVA) in IDH1 
mutant astrocytes and parental astrocytes (passages 2 and 40). Colour scale 
indicates f values. f, Kinetics of differential methylation in mutant and wild- 
type-expressing astrocytes. Error bars indicate inter-quartile range (n = 2). 


is based on the examination of primary glioblastomas in which IDH 
mutations are infrequent'*”. To determine the impact of IDH muta- 
tion on the methylation landscape in primary LGGs, we generated a 
high-resolution, genome-wide set of LGG methylome data from 
patients with complete clinical follow-up using the same Infinium 
450K platform as described earlier (72 WHO grade II and III gliomas; 
Fig. 2 and Supplementary Table 5). We first performed consensus 
clustering (Fig. 2a and Supplementary Fig. 2a) and unsupervised 
hierarchical clustering (Fig. 2b and Supplementary Fig. 2b) to identify 
LGG subgroups. We identified two robust DNA methylation clusters, 
one encompassing tumours with markedly high methylation levels 
(cluster 2) and another without the hypermethylated loci (cluster 1). 
Cluster 2 tumours demonstrated a characteristic DNA methylation 
profile with high-coordinate cancer-specific methylation at a subset of 
loci, concordant with the G-CIMP phenotype defined in glioblastomas 
(Supplementary Fig. 2b and Supplementary Table 6)'. The composition 
of the G-CIMP group in these LGGs was confirmed by two indepen- 
dent clustering methods (K-means consensus and two-dimensional 


480 | NATURE | VOL 483 | 22 MARCH 2012 


Lorenz curve 


1.0 
+ f 
als 0.8 | f 
Ofg 0.6 if 
OO} jf 
BL jf = 
a 0.4 — 2 clusters, Gini = 0.46 
iz Vf —3clusters, Gini = 0.55 
oe 0.2 /, ~ 4clusters, Gini = 0.60 
= 2 —5 clusters, Gini = 0.63 
oF 0 Zz 
On 0 02 04 06 08 1.0 
P 
b 2D hierarchical clustering 
DNA methylation probes 
CIMP. 
7 @CIMP+ 
& @ CIMP- 
4 IDH2 status 
o| @R172K 
FS @ R172 wild type 
= IDH1 status 
2 @R132H 
0) @R132L 
@ R132 wild type 
= 
0 0.5 1 
c 1.0, 
2 0. 0.84 
ral CIMP 
20. Zo] — MGMT 
a = methylation 
= 0. 2 — MGMT 
> B 0.44 : 
s expression 
5 0. Variable Area 
“ 0.24 CIMP 0.84 
o4 P<0.001 Methylation 0.71 
a E ion 0.43 
02 4 6 8 10 12 —— eee 
Time (years) 0 0.2 04 06 08 1.0 


Specificity 


Figure 2 | Global epigenetic analysis of LGGs reveals dependence of 
G-CIMP on IDH mutation. a, Identification of G-CIMP by K-means 
consensus clustering of LGG samples. Unsupervised clustering was performed 
with the most variant probes (9,711 probes, top 2%). Tumours are listed in the 
same order along the x and y axes. G-CIMP status is indicated by the black and 
white bars. Consensus index values range from 0 to 1, with 0 being dissimilar 
(white) and 1 being similar (red). K = 2 is identified by the Lorenz curve. 

b, Two-dimensional (2D) hierarchical clustering of the same probes as in 

a identified the same two clusters. Each row represents a tumour and each 
column represents a probe. CIMP and IDH mutation status are indicated by the 
colour code. The level of DNA methylation (f value) for each probe is 
represented by colour scale (red, methylated; blue, non-methylated). Only 
cancer-specific events were used”’. c, Kaplan-Meier survival curve of Memorial 
Sloan-Kettering Cancer Center (MSKCC) patients (n = 72) with LGG (grade II 
and III). d, Receiver operating characteristic (ROC) curve comparing the 
sensitivity and specificity of G-CIMP status compared with MGMT 
methylation or MGMT expression status, in LGGs. Areas under the curve are 
noted in the inset. G-CIMP, MGMT methylation and MGMT expression were 
determined as described in Methods. 


hierarchical clustering) (Fig. 2a, b). Probes defining CIMP in LGGs 
included those in CpG islands and shores (Supplementary Fig. 2c, d) 
and were enriched for PRC2-target genes (Supplementary Table 7). 
Global expression profiles showed that G-CIMP+ tumours possessed 
markedly different transcriptional profiles than G-CIMP— tumours 
(Supplementary Tables 8 and 9). EpiTYPER (Sequenom) mass 
spectrometry was used to validate the methylation status of loci in both 
the astrocyte model and in the tumours (Supplementary Fig. 2e-g)”’. 
To determine the mutational status of IDH1 and IDH2, we sequenced 
the entire coding sequence of the two genes in all the samples above 
(Fig. 2b). Ninety-eight per cent (49/50) of the G-CIMP+ tumours 
possessed either an IDH1 mutation or IDH2 mutation. Notably, none 
of the G-CIMP— tumours possessed mutant IDH (Supplementary 
Fig. 2h). These genomic data show that G-CIMP is highly dependent 
on the presence of IDH mutation and, in LGGs that are CIMP—, IDH 
mutations do not occur (0%). Currently, the methylation status of 
O-6-methylguanine DNA methyltransferase (MGMT) is a widely 
used molecular biomarker for glioblastoma prognosis and response to 


©2012 Macmillan Publishers Limited. All rights reserved 


temozolomide™*. In LGGs, G-CIMP associated with markedly better 
clinical endpoints (Fig. 2c and Supplementary Figs 3-6, Supplemen- 
tary Tables 10 and 11). Importantly, G-CIMP was significantly superior 
to MGMT methylation or MGMT messenger RNA expression as a 
predictor of survival (Fig. 2d). 

We next sought to define the nature of the methylome differences 
between IDH mutant and wild-type tumours and characterize the 
effects of these differences on the LGG transcriptome. Figure 3a shows 
a principal component analysis (PCA) of methylome and expression 
data from our tumours. PCA shows that G-CIMP+ and G-CIMP— 
LGGs methylome subgroups correlate with marked transcriptome 
differences (Fig. 3a). Of the 140,016 sites that were differentially 
methylated between IDH mutant and wild-type tumours, 121,660 
were hypermethylated (Supplementary Table 6). There were 2,611 
unique genes with alterations in promoter CpG islands represented 
in this group. Consistent with the results in Fig. 2b, a volcano plot 
showing differentially methylated genes between G-CIMP+ and 
G-CIMP— tumours was highly asymmetric (Fig. 3b). A starburst plot 
showing the relationship between DNA methylation and expression is 
shown in Fig. 3b. Integration of the normalized gene expression and 
DNA methylation gene sets identified 429 genes with both significant 
hypermethylation and downregulation and 176 genes that were 
hypomethylated and upregulated in G-CIMP+ LGGs (Supplemen- 
tary Table 12). Among these genes are those known to be involved 
in glioma initiation and outcome, including CDKN2C and GAP43 
(refs 25, 26). 

As a critical experiment to prove causality between IDH1 mutation 
and G-CIMP, we performed an in-depth comparison of methylation 
marks and gene expression alterations between human astrocytes 
expressing mutant IDH1 and the LGGs with endogenous IDH1 muta- 
tion. We first focused on the comparison of methylation marks and 
found that both sets of methylome alterations targeted similar loci. 
Gene set enrichment analysis (GSEA) of the mutant IDH1-induced 
methylation changes in the isogenic astrocyte system (Fig. 1) and the 
G-CIMP genes demonstrated very significant enrichment and con- 
cordance (Fig. 3c and Supplementary Table 13 and Supplementary 
Fig. 7). Importantly, the genes that were methylated after mutant 
IDH1 expression correctly classified LGG tumours into CIMP+ or 
CIMP— groups with very high accuracy (Fig. 3d and Supplementary 
Table 14). To confirm the impact of these alterations on glioma patho- 
biology, we used the transcriptomic footprint of mutant IDH to generate 
an expression signature (mutant IDH repression signature) composed 
of the most significantly methylated and downregulated genes in both 
the isogenic astrocyte system and the G-CIMP gene set (17 genes; 
Supplementary Table 15). As expected, this signature classified an inde- 
pendent LGG cohort (Rembrandt) into two distinct subgroups (Fig. 3e 
and Supplementary Figs 8-10 and Supplementary Table 16). Together, 
our findings show that introduction of mutant IDH reprograms the 
epigenome and generates the foundations of G-CIMP. 

IDH mutation is highly enriched in the CIMP+, proneural subgroup 
of glioblastomas. Using data from The Cancer Genome Atlas (TCGA), 
we applied the mutant IDH repression signature as a classifier to the 
transcriptomes of all four subgroups of glioblastomas”’. The signature 
segregated IDH mutant and wild-type proneural glioblastomas into two 
distinct subgroups associated with very different prognoses, but did not 
do so in other glioblastoma subgroups (Supplementary Fig. 11a, b). 
These data demonstrate that mutant IDH-induced epigenomic altera- 
tions have profound biological implications within the proneural class 
of glioblastomas that are specific for this subclass. Comparison of gene 
expression programs that occur in astrocytes expressing mutant IDH1 
to those in LGG tumours that harbour the IDH mutation showed 
remarkable similarity (Fig. 4a and Supplementary Fig. 12). Moreover, 
introduction of mutant but not wild-type IDH1 into astrocytes resulted 
in the upregulation of nestin (and other genes associated with stem cell 
identity) at the time of DNA methylation increase and the adoption 
of a neurosphere/stem-like phenotype (Fig. 4b and Supplementary 
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Figure 3 | IDH1 mutation directly generates the methylation patterns 
present in G-CIMP tumours. a, The methylomes and transcriptomes of LGGs 
are distinct. PCA plot of LGG tumours for all methylation probes (left) and 
expression probes (right) (n = 52). PC, principal component. b, Starburst plot 
(left) for comparison of DNA methylation and gene expression. The logy 
(FDR-corrected P value) is plotted for / value for DNA methylation (x axis) and 
gene expression (y axis) for each gene. Black dotted line shows the FDR- 
adjusted P value of 0.05. Red points indicate downregulated and 
hypermethylated genes in G-CIMP+ LGGs versus G-CIMP— LGGs. Blue 
points show hypomethylated and upregulated genes. Volcano (right) plot of all 
CpG loci analysed for G-CIMP association. The /-value difference in DNA 
methylation between G-CIMP+ and G-CIMP— tumours is plotted along the x 
axis. The P value between G-CIMP+ and G-CIMP— tumours is plotted on the 
y axis (—log)o scale). Red indicates significantly different probes. 

c, Concordance between hypermethylated sites in mutant IDH1-expressing 
astrocytes and G-CIMP+ LGGs. GSEA shows significant enrichment between 
730 hypermethylated unique CpG sites identified in IDH1 mutant astrocytes 
(ANOVA between passage 2 and 40) and those present in G-CIMP+ gliomas. 
GSEA correlation shown in colour scale. ES, enrichment score; FDR, false 
discovery rate; FEWR, familywise error rate; NES, normalized enrichment 
score; NOM, nominal P value. d, Differential methylation in IDH mutant 
astrocytes correctly classifies G-CIMP in the human LGGs. Two-dimensional- 
unsupervised hierarchical clustering of 81 human gliomas with top variant 
probes (n = 10,000) from mutant IDH1 astrocytes. Tumours are shown on the 
y axis, probes along the x axis. Methylation (f value) for each probe is 
represented with the colour scale. G-CIMP classification as determined by the 
astrocyte-derived data is denoted by the colour bars at the left. e, Kaplan-Meier 
survival curve of 115 patients with grade II or grade III gliomas in the 
Rembrandt Database grouped by CIMP status. P value calculated by log rank. 
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Figure 4 | Functional implications of IDH1-mutation-induced alterations 
in the glioma epigenome. a, Concordance of transcriptional programs 
regulated by mutant IDH1 in astrocytes and G-CIMP in LGGs. P value for 
significance is shown along the x axis. Yellow lines indicate threshold of 
significance (P = 0.05). b, Mutant IDH1 results in the expression of markers of 
self-renewal and stem cell identity. Left, mutant IDH1 results in expression of 
nestin. P indicates passage number. Right, expression of mutant IDH1 
promotes the adoption of a neurosphere phenotype. Astrocytes (passage 15) 
that express IDH R132H or IDH1 wild type were used in the neurosphere assay. 
Error bars indicate 1 s.d. **P < 0.01 (ttest). c, Alterations in histone marks in 
IDH1-mutant-expressing human astrocytes. Left, western blot results are 
shown using the indicated antibodies. Astrocytes are from passage 27. Right, 
ChIP of the indicated histone marks for representative hypermethylated genes. 
Error bars indicate 1 s.d. *P < 0.05. d, Mutant IDH1 inhibits the production of 
5hmC in human astrocytes. Left, mutant IDH inhibits TET2-dependent 5hmC 
production in astrocytes. Parental astrocytes were infected with lentivirus 
directing the expression of TET2 catalytic domain and green fluorescent 
protein (GFP) + mutant IDH1. FACS analyses are shown for 5hmC. Right, 
astrocytes expressing IDH R132H (passage 10) have less ShmC than astrocytes 
that do not express the IDH mutant. 


Fig. 13)’*. These data suggest that mutant IDH1 functions by interfer- 
ing with differentiation state. 

Our data show that IDH1 mutation is the mechanistic cause of 
G-CIMP. To gain further insight, we determined the effects of mutant 
IDHI1 on histone alterations in our astrocyte system. Figure 4c (left) 
shows that expression of the IDH1 mutant increases levels of H3K9me2, 
H3K27me3 and H3K36me3, consistent with previous findings”. 
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Chromatin immunoprecipitation (ChIP) experiments examining 
representative genes that undergo hypermethylation show H3K9 and 
H3K27 methylation are both enriched in cells expressing mutant IDH1 
(Fig. 4c, right). As both of these marks can promote DNA methylation, 
alterations in histone marks may contribute to the accumulation of 
DNA methylation. 

Next, we determined the effects the mutation had on TET2-dependent 
5-hydroxymethylcytosine (ShmC) levels. We used a well-established 
assay””’ and first confirmed that we were able to detect TET-dependent 
alterations in 5hmC (Supplementary Fig. 14). We found that expression 
of the IDH1 mutant in astrocytes resulted in a significant decrease in 
5hmC (Fig. 4d, right). Expression of TET2 in the astrocytes produced 
5hmC, which was inhibited by mutant but not wild-type IDH1 (Fig. 4d, 
left). Because TET-mediated production of 5hmC is a primary mode of 
DNA demethylation”®, inhibition of this activity in the IDH1-mutant- 
expressing astrocytes may be a mechanistic basis for accumulation of 
DNA methylation, ultimately leading to a CIMP pattern. 

IDH mutation and the CIMP phenotype are two very common 
features in cancer, the underlying mechanisms for which are obscure. 
The fundamental questions regarding these features are (1) how the 
IDH mutation contributes to oncogenesis, and (2) what the root cause 
of CIMP is. Our data address these important questions by demon- 
strating that IDH mutation is the cause of CIMP and leads to the CIMP 
phenotype by stably reshaping the epigenome. This remodelling 
involves modulating patterns of methylation on a genome-wide scale, 
changing transcriptional programs and altering the differentiation 
state. Our observations suggest that the activity of IDH may form 
the basis of an ‘epigenomic rheostat’, linking alterations in cellular 
metabolism to the epigenetic state. In summary, these data provide a 
mechanistic framework for how IDH mutation leads to oncogenesis 
and the molecular basis of CIMP in gliomas. We believe our observa- 
tions have critical implications for the understanding of gliomas and 
the development of novel therapies for this disease. 


METHODS SUMMARY 


Cell culture. Immortalized human astrocytes were a gift from R. O. Pieper 
(University of California, San Francisco) and were prepared as previously 
described"*. Cells were cultured in Dulbecco’s modified Eagle’s medium 
(DMEM) plus 10% fetal bovine serum (FBS; Invitrogen). Expression of IDH 
was accomplished by cloning wild-type or mutant IDH1 (R132H) into the vector 
pLNCX2. These constructs were used to construct lentiviruses used for infection of 
target cells. Selection was performed using G418. All experiments were performed 
in duplicate. 

Tumours. All tumours (” = 81) were obtained following surgical resection at the 
MSKCC as part of routine clinical care and snap frozen. Tumours were obtained in 
accordance with Institutional Review Board policies at the MSKCC. Each sample 
was examined histologically with haematoxylin-and-eosin-stained cryostat sections 
bya neuropathologist. Before analysis, tumours were sectioned and microdissected. 
Genomic DNA or RNA was extracted using the DNeasy kit (Qiagen) or RNeasy 
Lipid Tissue Mini kit (Qiagen) per the manufacturer’s instructions. 

Genomic analysis. Expression analysis of astrocytes and tumours was performed 
using the Affymetrix U133 2.0 microarray. Genome-wide methylation analysis 
was performed using the Illumina Infinitum HumanMethylation450 bead array. 
Processing of the arrays was as per the manufacturer’s protocol. Methylation data 
were extracted using GenomeStudio software (Illumina). Methylation values for 
each site are expressed as a (3 value, representing a continuous measurement from 
0 (completely unmethylated) to 1 (completely methylated). This value is based on 
the following calculation: / value = (signal intensity of methylation-detection 
probe) / (signal intensity of methylation- detection probe + signal intensity of 
non-methylation detection probe). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Cell culture. Immortalized human astrocytes were a gift from R. O. Pieper 
(University of California, San Francisco) and were prepared as previously 
described'*. Cells were cultured in Dulbecco’s modified Eagle’s medium 
(DMEM) plus 10% fetal bovine serum (FBS; Invitrogen). Expression of IDH 
was accomplished by cloning wild-type or mutant IDH1 (R132H) into the vector 
pLNCX2. These constructs were used to construct retroviruses used for infection 
of target cells. The retroviral packaging cell line GP-293 was seeded in 10-cm- 
diameter dishes and (at 30-50% confluency) was transfected using Lipofectamine 
(Invitrogen) with pVSV-G (Clontech) and pLNCX2-IDH1 wild type or IDH1 
R132H. Retroviral particles were collected, filtered through a 0.45-um syringe 
filter and polybrene was added (8 jig ml final concentration) to infect the human 
astrocytes for 12 h. Stable transfectants were selected with G418 and pooled popu- 
lations of G418-resistant cells expressing either wild-type IDH1 or IDH1 R132H 
were confirmed by western blot analysis with anti-IDH1 antibody (rabbit anti- 
IDH1; Cell Signaling). All experiments were performed in duplicate. 

Tumours. All tumours (1 = 81) were obtained following surgical resection at the 
MSKCC as part of routine clinical care, and snap frozen. Tumours were obtained 
in accordance with Institutional Review Board policies at the MSKCC. Each 
sample was examined histologically by a neuropathologist. Before analysis, 
tumours were sectioned and microdissected. Genomic DNA or RNA was 
extracted using the DNeasy kit (Qiagen) or Triazol (Invitrogen) as per the 
manufacturer’s instructions. Data from TCGA tumours (n = 173) are publically 
available”. For the LGG validation set, expression data sets of 115 patients with 
grade II and grade III gliomas were identified from the NCI Repository for 
Molecular Brain Neoplasia Data (Rembrandt; http://rembrandt.nci.nih.gov). 
Sample preparation. DNA from wild-type IDH1, R132H IDH1 and parental 
astrocytes was extracted with the Puregene Cell and Tissue Kit (Qiagen) at various 
passages (passages 2, 5, 10, 15, 20, 25, 30, 40 and 50) and RNA was extracted with 
Trizol (Invitrogen) according to the manufacturer’s directions. All experiments 
with the astrocytes were performed in duplicate, each with two corresponding 
technical (microarray) replicates. Genomic DNA and RNA from human tumours 
were extracted from frozen primary tumours for the methylation and expression 
studies. Frozen samples were snap frozen in liquid nitrogen and stored at —80 °C. 
Each sample was examined histologically with haematoxylin-and-eosin-stained 
sections by a neuropathologist and representative sections were microdissected 
from the slides. Genomic DNA was extracted with the Qiagen DNeasy Blood and 
Tissue Kit using the manufacturer’s instructions. RNA was extracted with Qiagen 
RNeasy Lipid Tissue Mini Kit using the manufacturer’s instructions. Nucleic acid 
quality was determined with the Agilent 2100 Bioanalyzer. 

Genomic analysis. Expression analysis of astrocytes and tumours was performed 
using the Affymetrix U133 2.0 microarray (Affymetrix). Genome-wide methylation 
analysis was performed using the Infinitum HumanMethylation450 bead array 
(Illumina). Processing of the arrays was per the manufacturer’s protocol. Methylation 
data were extracted using GenomeStudio software (Illumina). Methylation values for 
each site are expressed as a 3 value, representing a continuous measurement from 0 
(completely unmethylated) to 1 (completely methylated). This value is based on 
following calculation: / value = (signal intensity of methylation-detection probe)/ 
(signal intensity of methylation-detection probe + signal intensity of non- 
methylation detection probe). 

Data analysis. For methylation analysis, [lumina data were imported into Partek 
software. f Values were logit-transformed and adjusted for batch effects before 
analysis. Analysis of variance (ANOVA) with false discovery correction (FDR) was 
used to identify genes that were differentially methylated between the astrocytes 
expressing wild-type IDH1, mutant IDH1, and control astrocytes. Significant 
changes were defined as genes having an FDR-corrected P value < 0.05. In human 
tumours, unsupervised consensus clustering of the / values was performed with 
K-means clustering (Kax = 5) with Euclidean distance and average linkage over 
1,000 resampling iterations with random restart on the top 2% of the most variant 
probes (9,750 probes) using Gene Pattern v.2.0°'. This identified an optimal number 
of K = 2 groups. This was repeated using unsupervised hierarchical clustering using 
Pearson dissimilarity. The cluster of samples that exhibited a large degree of 
hypermethylation was identified as CIMP+, and the remaining group CIMP—. 
ANOVA with FDR correction was used to identify genes that were differentially 
methylated between the CIMP groups. Significant changes were defined as genes 
having an FDR-corrected P value < 0.05. 

For gene expression analysis of astrocytes, Affymetrix CEL files were imported 
into the R statistical software (v.2.13.0; http://www.R-project.org). Normalization 
was performed with the AffyPLM package in BioConductor (v.2.4), using RMA 
background correction, quantile normalization, and the Tukey biweight summary 
method. Differential expression was detected using the limma package and P 
values were adjusted for multiple testing using the FDR approach. A probe set 
is considered differentially expressed if the FDR-adjusted P value < 0.05. For gene 


expression analysis of the human tumours, the Affymetrix data were imported into 
the Partek Genomics Suite (Partek) as Affymetrix CEL files. The data were RMA 
normalized and median-scaled for analysis. ANOVA followed by FDR was used to 
identify genes that were differentially expressed between the CIMP groups. To 
derive the 17-gene mutant IDH1 repression signature, we identified 605 unique 
genes that had either statistically significant hypermethylation at promoter- 
associated CpG islands and decreased gene expression, or had hypomethylation 
at promoter-associated CpG Islands and increased gene expression in CIMP+ 
versus CIMP— tumours. We identified common genes in a comparison of this 
gene set with that derived from mutant IDH1-expressing astrocytes versus wild 
type. Differential methylation in the cell lines was defined as an FDR-adjusted 
P value < 0.05, and differential expression was defined as a P value < 0.05 with 
concordant fold change of at least 1.5 fold. 

The 17-gene expression signature was used to predict CIMP in the Rembrandt 
data set. Unsupervised hierarchical clustering using Pearson dissimilarity identified 
two unique clusters that were categorized as ‘predicted CIMP+’ and ‘predicted 
CIMP—’. The 17-gene expression signature was also used to identify subgroups 
from the TCGA GBM data set of 173 patients’. Unsupervised consensus clustering 
using Pearson dissimilarity was performed on each of the subclasses identified by 
the TCGA to identify clusters**. Rembrandt data sets were obtained at http:// 
caintegrator-info.nci.nih.gov/rembrandt. 

Functional analysis of gene lists was performed using the PANTHER database and 
categories with adjusted P values (Benjamini-Hochberg) < 0.05 were considered as 
significantly over-represented in our gene lists’. Concepts module mapping was 
performed as follows. The hypermethylation signature identified from our analysis 
of differentially methylated genes in IDH1 mutants compared to IDH1 wild-type 
was imported into Oncomine (http://www.oncomine.org) to identify associations 
with molecular concepts signatures derived from independent cancer profiling stud- 
ies. Statistically significant concordances of our methylation gene signature with the 
pre-defined concepts were identified and Q value was calculated as previously 
described**. 

Methylation data of parental, wild-type IDH1-expressing astrocytes, and 
mutant IDH1-expressing astrocytes were clustered using self-organizing maps 
and visualized with the Gene Expression Dynamics Inspector (GEDI; v.2.1). For 
GEDI analysis, methylation data were normalized as a group across all passages 
and genotypes. Further hierarchical clustering (average-linkage) of GEDI map 
centroids was performed in R using the hclust library in the stats package. 
GSEA was performed using GSEA software v.2.0 and MSigDB database v.2.5. 
We assessed the significance of the curated gene sets (MSigDB collection 2) 
with the following parameters: number of permutations= 1,000 and 
permutation_type = phenotype, with an FDR Q-value cut-off of 5% (ref. 35). 

G-CIMP was compared to MGMT methylation and MGMT expression in 52 
LGG samples in the MSKCC cohort. We identified the Illumina 450K methylation 
probe ID (cg12981137) that corresponded to the MGMT MSP primer sequence as 
identified previously** and the Affymetrix probe ID (204880_at) that corresponds 
to MGMT expression. 

Clinical and pathological characteristics between cohorts were compared using 

the 7° test. Overall survival was calculated from the date of surgery to death from 
any cause. Patients were censored at the time they were last known to be alive. 
Overall survival was assessed using the Kaplan-Meier method and the log-rank 
test was used for comparison between groups. Multivariate analysis was performed 
using a Cox proportional hazards model to assess the independent effect of 
prognostic variables on outcome, and using binary logistic regression to predict 
the probability of occurrence of CIMP+. An ROC curve was generated to graph 
the sensitivity and specificity of CIMP, MGMT methylation, and MGMT expres- 
sion to predict survival =3 years. MGMT methylation and MGMT expression was 
considered continuous variables, and CIMP a categorical variable (defined by 
unsupervised hierarchical analysis as described above). Patients that were alive 
and had less than 3 years of follow-up were excluded from this analysis. Data was 
analysed using SPSS software (IBM SPSS statistics version 19.0). 
Quantitative DNA methylation analysis using mass spectrometry. DNA 
methylation analysis was performed using the EpiTYPER system (Sequenom). 
The EpiTYPER assay is a tool for the detection and quantitative analysis of 
DNA methylation using base-specific cleavage of bisulphite-treated DNA and 
matrix-assisted laser desorption/ionization time-of-flight mass spectrometry 
(MALDI-TOF MS)”. For primer sequences, target chromosomal sequence, and 
EpiTYPER-specific tags, see Supplementary Table 17. SpectroCHIPs were ana- 
lysed using a Bruker Biflex III MALDI-TOF mass spectrometer (SpectroREADER, 
Sequenom). Results were analysed using the EpiTYPER Analyzer software, and 
manually inspected for spectra quality and peak quantification. CIMP positivity 
was defined as a mean methylated allelic frequency of >50% or a twofold increase 
over normal breast tissue and the CIMP- state. 
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PCR amplification and sequencing. Exonic regions for the [DH1 and IDH2 
genes (NCBI Human Genome Build 36.1) were broken into amplicons of 
500 bp or less, and specific primers were designed using Primer3. Standard M13 
tails were added to the primers to facilitate Sanger sequencing. PCR reactions were 
carried out in 384-well plates in a Duncan DT-24 water bath thermal cycler with 
10 ng of whole-genome amplified DNA (REPLI-g Midi, Qiagen) as a template, 
using a touchdown PCR protocol with KAPA Fast HotStart (Kapa Biosystems). 
The touchdown PCR method consisted of: 1 cycle of 95°C for 5 min; 3 cycles of 
95°C for 30s, 64 °C for 15 s, 72 °C for 30 s; 3 cycles of 95 °C for 30 s, 62 °C for 15s, 
72°C for 30s; 3 cycles of 95 °C for 30s, 60°C for 15s, 72°C for 30 s; 37 cycles of 
95 °C for 30s, 58 °C for 15s, 72 °C for 30s; 1 cycle of 70 °C for 5 min. Templates 
were purified using AMPure (Agencourt Biosciences). The purified PCR reactions 
were split into two and sequenced bidirectionally with M13 forward and reverse 
primer and the Big Dye Terminator Kit v.3.1 (Applied Biosystems) at Agencourt 
Biosciences. Dye terminators were removed using the CleanSEQ kit (Agencourt 
Biosciences), and sequence reactions were run on ABI PRISM 3730xl sequencing 
apparatus (Applied Biosystems). Sanger sequencing of IDH1 and IDH2 produced 
an average coverage of 96.1% of coding sequence nucleotides across all samples. 
Mutation detection. Passing reads were assembled against reference sequences, 
containing all coding exons including 5 kb upstream and downstream of the gene, 
using command line Consed 16.0°’. Assemblies were passed on to Polyphred 
6.02b**, which generated a list of putative candidate mutations, and to Polyscan 
3.0°, which generated a second list of putative mutations. The lists were merged 
together into a combined report, and the putative mutation calls were normalized 
to ‘+’ genomic coordinates and annotated using the Genomic Mutation 
Consequence Calculator**. The resulting list of annotated putative mutations 
was loaded into a Postgres database along with select assembly details for each 
mutation call (assembly position, coverage, and methods supporting mutation 
call). To reduce the number of false positives generated by the mutation detection 
software packages, only point mutations that were supported by at least one 
bi-directional read pair and at least one sample mutation called by Polyphred were 
considered, and only the putative mutations that were annotated as having non- 
synonymous coding effects, occurred within 1 bp of an exon boundary, or had a 
conservation score > 0.699 were included in the final candidate list. Indels were 
manually reviewed and included in the candidate list if found to hit an exon. All 
putative mutations were confirmed by a second PCR and sequencing reaction, in 
parallel with amplification and sequencing of matched normal tissue DNA. 
ChIP. Cells were fixed with 1% formaldehyde for 10 min at room temperature 
(21°C) and formaldehyde was inactivated by the addition of 125mM glycine. 
ChIP assays were performed using a protocol recommended by the manufacturer 
ofa commercially available ChIP assay kit (17-371, Millipore). Chromatin extracts 
were immunoprecipitated using anti-H3K9me3 (Ab8898, Abcam) or anti- 
H3K27me3 (07-449, Millipore) antibodies. After washing, ChIPed DNA was 
eluted from the beads and analysed on an Eppendorf Realplex using SYBR 
Green (Applied Biosystems). Relative occupancy values were calculated by deter- 
mining ratios of the amount of immunoprecipitated DNA to that of the input 
sample (2% of total). 
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Flow cytometry and 5hmC assay. HEK 293T cells were transiently transfected 
with Flag~TET2 in pCMV6-ENTRY vector with Lipofectamine 2000 (GIBCO). For 
two-colour flow cytometry, 10° cells were washed with ice-cold PBS, permeabilized 
and fixed using BD Cytoperm/Cytofix solution (BD, PharMingen), and incubated 
with anti-Flag (1:200, Sigma) and anti-ShmC (1:400, Active Motif #39770) 
antibodies for 30 min at room temperature. Cells were washed with PBS and 
incubated with secondary antibodies conjugated with Alexa Fluor 488 or Cy5 
(Invitrogen) for 30 min in the dark. For single-colour flow cytometry, parental 
and IDH1 mutant astrocytes were stained using anti-ShmC (1:400) followed by 
Alexa Fluor 488 secondary antibody. Cells were washed in PBS and analysed using 
the FACScan flow cytometer (Becton Dickinson). FACS data were analysed using 
FLowJo Software (TreeStar). 

Neurosphere assay. IDH1(R123H)-expressing astrocytes and parental controls 
were grown in media permissive of neural stem cell growth as previously 
described“’. Briefly, immortalized human astrocyte (IHA) cells stably expressing 
wild-type or R132H mutant IDH1 at passage 15 were seeded in 6-well plates at 
200,000 cells per well. The next day, proliferation medium (DMEM plus 10% FCS) 
was replaced with neural stem cell medium made from serum-free DMEM supple- 
mented with B27 and N2 supplements (all from Invitrogen), bFGF, EGF and 
PDGFAA (all at 20ngml', all from PeproTech). Medium was replaced every 
2-3 days. Neurospheres were quantified using microscopy. Experiments were per- 
formed in triplicate. 
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Transformation by the (R)-enantiomer of 
2-hydroxyglutarate linked to EGLN activation 


Peppi Koivunen'*, Sungwoo Lee**, Christopher G. Duncan’, Giselle Lopez’, Gang Lw’, Shakti Ramkissoon”*°, Julie A. Losman?, 
Paivi Joensuu®, Ulrich Bergmann’, Stefan Gross®, Jeremy Travins®, Samuel Weiss’, Ryan Looper!®, Keith L. Ligon?*°", 


Roel G. W. Verhaak!, Hai Yan® & William G. Kaelin Jr?° 


The identification of succinate dehydrogenase (SDH), fumarate 
hydratase (FH) and isocitrate dehydrogenase (IDH) mutations in 
human cancers has rekindled the idea that altered cellular metabolism 
can transform cells. Inactivating SDH and FH mutations cause the 
accumulation of succinate and fumarate, respectively, which can 
inhibit 2-oxoglutarate (2-OG)-dependent enzymes, including the 
EGIN prolyl 4-hydroxylases that mark the hypoxia inducible factor 
(HIF) transcription factor for polyubiquitylation and proteasomal 
degradation’. Inappropriate HIF activation is suspected of con- 
tributing to the pathogenesis of SDH-defective and FH-defective 
tumours but can suppress tumour growth in some other contexts. 
IDH1 and IDH2, which catalyse the interconversion of isocitrate 
and 2-OG, are frequently mutated in human brain tumours and 
leukaemias. The resulting mutants have the neomorphic ability to 
convert 2-OG to the (R)-enantiomer of 2-hydroxyglutarate ((R)- 
2HG)**. Here we show that (R)-2HG, but not (S)-2HG, stimulates 
EGLN activity, leading to diminished HIF levels, which enhances 
the proliferation and soft agar growth of human astrocytes. These 
findings define an enantiomer-specific mechanism by which the 
(R)-2HG that accumulates in IDH mutant brain tumours promotes 
transformation and provide a justification for exploring EGLN 
inhibition as a potential treatment strategy. 

To study the role of IDH mutations in brain tumours, we stably 
infected immortalized human astrocytes with retroviral vectors encod- 
ing haemagglutinin (HA)-tagged versions of wild-type IDH1, a tumour- 
derived mutant (IDH1 R132H)*”, or an IDH1 R132H variant in which 
three conserved aspartic acid residues within the IDH1 catalytic 
domain were replaced with asparagines (R132H/3DN) (Fig. la and 
Supplementary Fig. 1). As expected, (R)-2HG levels, but not (S)- 
2HG levels, were markedly increased in the cells producing IDH1 
R132H but not in cells producing the R132H/3DN variant (Fig. 1b 
and Supplementary Fig. 2). In multiple independent experiments the 
IDH1 R132H cells acquired a proliferative advantage relative to cells 
producing the other versions of IDH1 beginning around passage 14, 
manifested as increased proliferation at confluence (Fig. 1c) and the 
ability to form macroscopic colonies in soft agar (Fig. 1d, e). 

Consistent with recent reports, we found that both (R)-2HG and 
(S)-2HG inhibit a number of 2-OG-dependent enzymes in vitro*®, 
including the collagen prolyl 4-hydroxylases, the TET1 and TET2 
methyl cytosine hydroxylases, the HIF asparaginyl hydroxylase FIH1 
(also known as HIF1AN) and the JMJD2D (also known as KDM4D) 
histone demethylase, with (S)-2HG being a more potent inhibitor than 
(R)-2HG (Supplementary Fig. 3 and data not shown). (S)-2HG was 


also a micromolar to low millimolar inhibitor of the three mammalian 
HIF prolyl 4-hydroxylases (EGLN1, EGLN2 and EGLN3) under 
standard assay conditions, which included 10 1M 2-OG (Supplemen- 
tary Fig. 4). In contrast, (R)-2HG was not an effective EGLN inhibitor 
(half-maximum inhibitory concentration (IC;9) values >5mM; 
Supplementary Fig. 4). Moreover, we discovered unexpectedly that 
(R)-2HG, but not (S)-2HG, promoted EGLN1 and EGLN2 activity, 
and to a lesser extent EGLN3 activity, at tumour-relevant concentra- 
tions (low mM) in reactions that lacked exogenous 2-OG (Fig. 2a, c, d 
and Supplementary Fig. 5a, b). This was specific because (R)-2HG did 
not promote collagen prolyl 4-hydroxylase activity (Fig. 2b) or 
JMJD2D histone demethylase activity (data not shown). Similar results 
were obtained with EGLN1 purified from either insect cells or 
Escherichia coli and with a heat-inactivated HIF polypeptide substrate 
(Supplementary Fig. 6), making it unlikely that the ability of (R)-2HG 
to promote EGLN activity required a contaminating enzyme. 

To determine how (R)-2HG might promote EGLN activity, we 
monitored the EGLN1 prolyl 4-hydroxylase reaction using liquid 
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Figure 1 | Oncogenic properties of IDH1 R132H. a,b, Anti-HA immunoblot 
(a) and LC-MS analysis (b) of immortalized human astrocytes (passage 4) 
infected with retroviruses encoding HA-tagged versions of the indicated IDH1 
variants. ¢, d, In vitro proliferation under standard culture conditions (c) or in 
soft agar (d) (passage 23). Scale bars, 0.5 mm. e, Number of macroscopic soft 
agar colonies in d (*P < 0.01, **P < 0.005). Error bars show standard deviation 
(s.d.); n = 3. 
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Figure 2 | (R)-2HG can serve as an EGLN cosubstrate. a, b, In vitro prolyl 
4-hydroxylation assays conducted with recombinant EGLN proteins (a) and 
collagen P4H-I (b) in the presence of the indicated amounts of 2-OG or 2HG. 
4-Hyp/Pro, ratio of 4-hydroxyproline to proline. L- [2,3,4,5-7H] proline-labelled 
HIF-1« oxygen-dependent degradation domain (ODDD) (a) and [4c] proline- 
labelled protocollagen (b) were used as substrates. Enzymes were produced in 
insect cells using baculoviruses and affinity purified. Error bars, s.d.; n = 3-4. 
c, d, K,, (in uM) and maximum enzyme velocity (Vina percentage of that 
obtained with 2-OG) values for (R)-2HG for EGLN family members. K,, and 
Vimax Values for 2-OG"' are included for comparison. d.p.m., disintegrations per 
minute. N.D., not defined. e, LC-MS analysis of succinate, 2-OG and (R)-2HG 
from enzymatic reactions with EGLN1, HIF-1«% ODDD polypeptide and either 
5 mM (R)-2HG (red) or 80 UM 2-OG (black) as cofactors. Numbers next to 
each peak indicate elution times (bottom) and peak areas (top). No peaks above 
background were detected in samples in which 2-OG and (R)-2HG were both 
omitted (data not shown). f, Model of (R)-2HG (green) and (S)-2HG (cyan) 
bound to the active site of EGLN1. N-oxalylglycine (magneta) bound in the 
original structure” is shown for comparison. The active-site water molecule, 
which has been shown to be the O-binding site*’, is shown in red and the 
peptide substrate in light blue. Hydrogen bonds are indicated by dash lines. 
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chromatography—mass spectrometry (LC-MS). 2-OG and succinate 
were detected when catalytically active EGLN1 was incubated with 
5mM (R)-2HG and a recombinant HIF-1« polypeptide, suggesting 
that EGLN1 can oxidize (R)-2HG to 2-OG, which is then decarboxylated 
to succinate during the hydroxylation reaction (Fig. 2e and Sup- 
plementary Fig. 5c). In support of this model, '*C-labelled succinate 
was generated in EGLN hydroxylation assays that contained uniformly 
labelled '°C-(R)-2HG (Supplementary Fig. 5d). 

Consistent with the idea that (R)-2HG can substitute for 2-OG as a 
cosubstrate, the addition of increasing amounts of (R)-2HG to EGLN1 
assays containing 10 1M 2-oxo-[1-'*C]glutarate progressively decreased 
the release of '*CO, without decreasing the prolyl hydroxylation of 
HIF-1a, even at concentrations as high as 100 mM (Supplementary 
Fig. 7). Modelling of 2HG bound to the active site of EGLN1 predicts 
that binding of the (S)-enantiomer, but not the (R)-enantiomer, would 
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prevent the subsequent recruitment of oxygen to the active site, 
perhaps accounting for the qualitatively different effects of the two 
enantiomers on EGLN activity (Fig. 2f and Supplementary Fig. 5e). 

To investigate whether these findings were relevant in vivo we 
examined HIF levels in IDH1 mutant cells. In keeping with our bio- 
chemical results, HIF-1% and HIF-2« protein levels were reproducibly 
lower in mid-passage (passage 9-15) human astrocytes producing 
IDH1 R132H relative to control astrocytes (Fig. 3a), due at least partly 
to increased HIF-la and HIF-2« hydroxylation and diminished 
protein stability (Supplementary Figs 8 and 9), and were associated 
with lower levels of the HIF-responsive messenger RNAs encoding 
VEGF, GLUT1 and PDK1 (Supplementary Fig. 10a). Oxygen con- 
sumption and reactive oxygen species production, which can also 
affect HIF levels and the HIF response, were not measurably altered 
in the IDH1 mutant cells (Supplementary Fig. 11). IDH1(R132H)- 
expressing cells were relatively resistant to the 2-OG competitive 
antagonist dimethyloxalylglycine (DMOG), but not to the iron 
chelator deferoxamine (DFO) (Fig. 3b), consistent with (R)-2HG 
acting as a 2-OG agonist in intact cells. In later passages (more than 
passage 20), HIF levels in IDH1(R132H)-expressing immortalized 
human astrocytes began to normalize despite persistent production 
of the exogenous IDH1 protein and (R)-2HG (Supplementary Fig. 12), 
possibly due to adaptive HIF-responsive feedback loops such as those 
involving EGLN3 and microRNA miR-155 (refs 1, 7). These cells, 
however, retained the ability to form colonies in soft agar (data not 
shown) and remained addicted to EGLN activity (see later). 

Similarly, the induction of HIF-1o and HIF-responsive mRNAs by 
hypoxia was diminished in two independent cell lines derived from 
two IDH1 R132H, 1p/19q-codeleted oligodendrogliomas compared to 
a control IDH1 wild-type, 1p/19q-codeleted oligodendroglioma line 
that had been generated in a similar fashion (Fig. 3c and Supplemen- 
tary Fig. 10b). Although these three cell lines have similar growth 
kinetics in vitro (data not shown) they are not isogenic. We therefore 
also tested two HCT116 colorectal cancer cell sublines wherein the 
R132H mutation was introduced into the endogenous IDH1 locus by 
homologous recombination. These sublines also showed a diminished 
HIF response compared to wild-type cells unless EGLN was pharma- 
cologically (DFO) or genetically (short hairpin (sh)RNA) inactivated 
(Fig. 3d, e and Supplementary Fig. 13). 

Finally, we asked whether IDH mutational status influenced HIF 
activity in primary patient astrocytoma samples using the TCGA 
expression data set* and a previously defined HIF-responsive gene 
expression signature’. The HIF signature was diminished in proneural 
tumours—which is the subtype most often associated with IDH muta- 
tions*—relative to other gene-expression-defined subtypes of brain 
cancer (Supplementary Fig. 14a, b) and, notably, was diminished in 
IDH mutant proneural tumours relative to wild-type proneural 
tumours (Fig. 3f and Supplementary Fig. 14c). Similar results were 
obtained with other previously published HIF gene sets and with a 
manually curated data set (Supplementary Fig. 14a, d). 

Downregulating HIF- 10 with three independent shRNAs promoted 
soft agar growth by immortalized human astrocytes after approxi- 
mately 15 passages (Fig. 4a, b and Supplementary Fig. 15), as did 
overproduction of wild-type, but not catalytic-defective, EGLN1 
(Fig. 4c, d and Supplementary Fig. 16). Conversely, downregulation 
of EGLN1 with multiple shRNAs inhibited the proliferation of late 
passage IDH1 R132H cells (Fig. 4e, f and Supplementary Fig. 17) 
unless HIF-1o% was concurrently ablated (Fig. 4g, h and Supplemen- 
tary Fig. 18). 

Collectively, these data suggest that EGLN activation by (R)-2HG, 
and subsequent downregulation of HIF-1«, contributes to the patho- 
genesis of IDH mutant gliomas. Our data do not, however, exclude the 
possibility that (R)-2HG has additional targets, including TET2 and 
JmjC-containing histone demethylases, which contribute to its ability 
to transform cells*®. Indeed, we found that downregulation of TET2 
in human astrocytes also promotes soft agar growth (Supplementary 
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Figure 3 | HIF activity is diminished in IDH mutant cells. a-d, Immunoblot 
analysis of immortalized human astrocytes (passage 10) 

(a, b), oligodendroglioma cells (c) and HCT116 colorectal cells (d) expressing 
the indicated IDH1 variants grown under 21% (N) or 7.5% (H) oxygen for 24h 
before lysis or under 21% oxygen in the presence of DFO (D). Names of cell 
lines are indicated in brackets. In b, cells grown under 21% oxygen were treated 


Fig. 19). Nonetheless, our finding that (R)-2HG and (S)-2HG are 
qualitatively different with respect to EGLN might explain the appar- 
ent selection for (R)-2HG in adult tumours, despite the fact that (S)- 
2HG is a more potent inhibitor of most of the 2-OG-dependent 
enzymes tested so far. It should be noted, however, that (S)-2HG 
has been linked to neurological abnormalities and brain tumours in 
children and young adults with germline (S)-2-HG dehydrogenase 
mutations’®’. The pathogenesis of (R)-2HG-driven tumours linked 
to somatic IDH mutations in adults and (S)-2HG-driven tumours 
linked to inborn errors of metabolism conceivably differ, with the latter 
possibly reflecting the perturbation of one or more neurodevelopmental 
programs during embryogenesis. 

Nor are our data incompatible with the finding that HIF-1 protein 
levels are increased in IDH mutant tumours relative to normal brain”. 
Our data simply suggest that (R)-2HG quantitatively shifts the dose- 
response linking HIF activation to hypoxia, leading to a blunted HIF 
response for a given level of hypoxia. In support of this idea, HIF 
elevation in IDH mutant tumours is usually confined to areas of 
necrosis and presumed severe hypoxia’. 

Although HIF is typically viewed as an oncoprotein it can behave as 
a tumour suppressor in embryonic stem cells'*"®, leukaemic cells’” and 
brain tumour cells'*”’. For example, HIF-10 has been shown to score as 
an oncoprotein when transformed murine astrocytes are grown 
subcutaneously but as a tumour suppressor when such cells are grown 
orthotopically’’. This finding, together with our results, raises the 
possibility that pharmacological inhibition of EGLN activity (and the 
resulting increase in HIF activity) would impair the growth of IDH1 
mutant tumours. A caveat, however, is that IDH mutant tumours tend 
to be relatively indolent”. It is possible that low levels of HIF-1c, 
although promoting some aspects of transformation, simultaneously 
suppress other hallmarks of cancer required for aggressive behaviour 
(such as angiogenesis). 
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with increasing amounts of DMOG or DFO for 16h before lysis. e, Quantitative 
real-time PCR analysis of cells in d under normoxic conditions. Error bars, s.d. 
n = 3. f, Heat map depicting expression of HIF target genes (blue, lower 
expression; red, higher expression) in proneural tumours clustered based on 
IDH status (in the horizontal bar above the matrix, red indicates mutant; 
green, wild type). 


The published 2-OG Michaelis constant (K,,,) values for the EGLN 
family members are below the estimated intracellular 2-OG concen- 
tration®’”, suggesting that 2-OG should not be limiting for EGLN 
activity and that EGLN activity would not be enhanced further by 
(R)-2HG. A caveat is that a considerable amount of intracellular 
2-OG appears to be sequestered in mitochondria and might also be 
bound by other 2-OG-dependent enzymes”. Moreover, these 2-OG 
Kn Values were determined under idealized conditions with purified 
enzymes and substrates in the absence of endogenous inhibitors such 
as reactive oxygen species, nitric oxide and 2-OG competitive mole- 
cules such as succinate and fumarate’. Studies in model organisms 
suggest that many metabolic enzymes are saturated in vivo with a 
mixture of substrate and competitive inhibitors and thus are sensitive 
to changes in substrate concentrations that are far above their nominal 
K,, Values****. We confirmed that succinate and fumarate increase the 
2-OG requirement for the EGLN reaction and that their inhibitory 
activity, like that of DMOG (Fig. 3b) and its active derivative 2-oxalyl 
glycine, is blunted in the presence of (R)-2HG (Supplementary Fig. 20). 

We have not yet formally proven that (R)-2HG is sufficient to 
downregulate HIF in intact cells. It is possible, for example, that addi- 
tional metabolic changes in IDH mutant cells sensitize them to the HIF 
modulatory effects of (R)-2HG”. Another, potentially related, obser- 
vation is that both downregulation of HIF and transformation by 
mutant IDH in immortalized human astrocytes, although highly 
reproducible, was only noted after multiple passages. HIF activates a 
number of genes, including genes that participate in feedback regu- 
lation of the HIF response and genes that modify chromatin structure’. 
It is possible that modulation of the HIF response over time, perhaps in 
conjunction with alterations in other enzymes affected by 2-HG, leads 
to epigenetic changes that ultimately are responsible for transforma- 
tion. If so, it will be important to determine the degree to which such 
changes are reversible. 
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IDH. a, c, Soft agar colony formation by human astrocytes after stable infection 
with lentiviruses encoding the indicated HIF-1% shRNAs or scrambled (scr) 
shRNA (a) or lentiviruses encoding EGLN1, EGLN1 (P317R), or venus 
fluorescent protein (c; RNAi Consortium shRNA clone identifiers are indicated 
in brackets). Scale bars, 0.5 mm. b, d, Number of macroscopic soft agar colonies 
in a and ¢, respectively. *P < 0.05, **P < 0,005. Error bars, s.d. n = 3. 

e, f, Proliferation (e) and immunoblot analysis (f) of human astrocytes 
expressing wild-type or R132H IDH1 and an shRNA against EGLN1 (10578) 
(or scrambled control). g, h, Proliferation of human astrocytes expressing IDH1 
R132H and shRNAs against HIF-1% (0819), EGLN1 (10578), or both. 


METHODS SUMMARY 


NHAs immortalized with E6/E7/hTERT have been described elsewhere” and were 
subsequently infected with IDH retroviruses. IDH1 R132H HCT116 colorectal 
cancer cells were made by homologous recombination as previously described”’. 
Anaplastic oligodendroglioma cell lines were derived from resection material 
obtained at surgeries performed at the Brigham and Women’s Hospital. 
Lentiviral shRNA vectors were from the Broad Institute. Immunoblots were per- 
formed with anti-HIF-1« monoclonal antibody (BD Transduction Laboratories), 
anti-HIF-2« polyclonal antibody (NB 100-122, Novus), anti-Flag monoclonal 
antibody (M2, Sigma-Aldrich), monoclonal anti-HA (HA-11, Covance Research 
Product), anti-EGLN1 monoclonal antibody (D31E11, Cell Signaling), anti- 
GLUT1 polyclonal antibody (NB300-666, Novus Biologicals), monoclonal anti- 
tubulin (B-512, Sigma-Aldrich) or anti-vinculin monoclonal antibody (V9131, 
Sigma-Aldrich). Metabolite levels were determined by LC-MS as previously 
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described’. Enzyme activity assays were performed with affinity-purified human 
EGIN1, 2 and 3, collagen P4H-I and FIH, and murine Tet1 and Tet2 produced in 
insect cells using baculoviral vectors. (R)-2HG and (S)-2HG were from Sigma. (R)- 
2HG was free of contaminating 2-OG as determined by LC-MS under conditions 
that could detect 0.37 1M exogenous 2-OG in 5 mM (R)-2HG (data not shown). 
Human brain tumour mRNA expression data were obtained from The Cancer 
Genome Atlas and processed as described*’*. Cell proliferation assays were 
performed using XTT assays (Cell Proliferation Kit II, Roche) according to the 
manufacturer’s instructions or by direct counting of viable cells using an 
automated cell counter (Invitrogen). For soft agar assays ~8,000 cells were sus- 
pended in a top layer of 0.4% soft agar (SeaPlaque Agarose, BMA products) and 
plated on a bottom layer of 1% soft agar containing DMEM supplemented with 
10% fetal bovine serum. For real-time qPCR analysis total RNAs were first 
extracted with Trizol reagent (Invitrogen). cDNA synthesis and PCR amplification 
were performed with Superscript One-Step RT-PCR (Invitrogen). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell lines. The NHA cell line immortalized with E6/E7/hTERT has been described 
elsewhere*® and was maintained in DMEM containing 10% fetal bovine serum 
(FBS) and 1% penicillin/streptomycin in the presence of 10% CO, at 37°C. 
Following retroviral infection, cells were maintained in the presence of hygromycin 
(100 pg ml"). 

Introduction of the IDH1 R132H mutation into HCT116 colon cancer cell lines 
by homologous recombination was as previously described”’. Briefly, targeting 
constructs were designed using the pSEPT rAAV shuttle vector*’. Homology 
arms for the targeting vector were PCR amplified from HCT116 genomic DNA 
using Platinum Taq HiFi polymerase (Invitrogen). The R132H hotspot mutation 
was introduced in the targeting construct using the Quickchange II site-directed 
mutagenesis kit (Stratagene). An infectious rAAV stock harbouring the targeting 
sequence was generated and applied to parental HCT116 cells as previously 
described, and clones were selected in 0.5mgml! geneticin (Invitrogen). 
Excision of the selectable element was achieved with an adenovirus encoding 
Cre recombinase (Vector Biolabs). Genomic DNA and total RNA were isolated 
from cells using QlAmp DNA Blood Kit and RNeasy kit (Qiagen). First strand 
cDNA was synthesized using iScript cDNA Synthesis Kit (BioRad). Successful 
homologous recombination and Cre-mediated excision were verified using 
PCR-based assays and by direct sequencing of genomic DNA and cDNA. 

Anaplastic oligodendroglioma cell lines expressing wild-type IDH1 (BT260) or 

IDH1 R132H (BT237 and BT138) were derived from surgical resection material 
acquired from patients undergoing surgery at the Brigham and Women’s Hospital 
on an Institutional Review Board approved protocol. Briefly, tumour resection 
samples were mechanically dissociated and tumospheres were established and 
propagated in Human NeuroCult NS-A Basal media (StemCell Technologies) 
supplemented with EGF, FGFb and heparin sulphate. All lines were obtained from 
the DF/BWCC Living Tissue Bank and confirmed to be derived from recurrent 
and progressive anaplastic oligodendrogliomas, World Health Organization grade 
IIL, at the time of cell-line isolation, and to have chromosome 1p/19q codeletion. 
The presence of IDH R132H mutation was confirmed by mutant specific antibody 
staining via immunohistochemistry and direct DNA sequencing. 
Vectors. Human IDH1 cDNAs (wild type and R132H) were subcloned as BamHI- 
EcoRI fragments into pBabe-HA-hygro. Aspartic acid residues 273, 275 and 279 
were mutated to asparagines (N) by site-directed mutagenesis and confirmed by 
DNA sequencing. EGLNI, EGLN2 and EGLN3 cDNAs were subcloned into 
pLenti6-Flag expression vector after restriction with XbaI and EcoRI. pLenti6- 
Flag-EGLN1 P317R*’ was prepared by QuikChange Mutagenesis (Stratagene) 
using pLenti6-Flag EGLN1 as a template and the following oligonucleotides: 
5'-GTACGTCATGTTGATAATCGAAATGGAGATGGAAGATCTG-3’ and 5'- 
CACATGTTCCATCTCCATTTCGATTATCAACATGACGTAC-3’. The entire 
EGLN1 coding region was sequenced to verify its authenticity. 

Lentiviral (pLKO.1) HIF-la shRNA vectors (TRCN0000003810, target 
sequence: 5'-GTGATGAAAGAATTACCGAAT-3’; TRCN0000010819, target 
sequence: 5'-TGCTCTTTGTGGTTGGATCTA-3’; TRCN0000003809, target 
sequence: 5'-CCAGTTATGATTGTGAAGTTA-3’), EGLN1 (PHD2) shRNA 
vectors (TRCN0000001042, target sequence: 5'-CITGTTATCTAGCTGAGTT 
CAT-3’; TRCN0000001043, target sequence: 5’-GACGACCTGATACGCCACT 
GT-3'; TRCN00000010578, target sequence: 5’-TGCACGACACCGGGAAGT 
TCA-3’) and TET2 shRNA vectors (TRCN0000122172 (122), target sequence: 
5'-GCGTTTATCCAGAATTAGCAA-3’; TRCN0000144344 (145), target sequence: 
5’-CCTTATAGTCAGACCATGAAA-3’) were obtained from the Broad Institute 
TRC shRNA library. pLKO.1 shRNA with target sequence 5’-GCAAG 
CTGACCCTGAAGTTCAT-3’ was used as negative control shRNA. 
Immunoblot analysis. Cells extracts were prepared with 1X lysis buffer (50 mM 
Tris (pH 8.0), 120 mM NaCl, 0.5% NP-40) supplemented with a protease inhibitor 
cocktail (Complete, Roche Applied Science), resolved on 10% SDS-PAGE gels and 
transferred to nitrocellulose membranes (Bio-Rad). Membranes were blocked in 
TBS with 5% non-fat milk and probed with anti-HIF-1« monoclonal antibody 
(BD Transduction Laboratories), anti- HIF-2« polyclonal antibody (NB 100-122, 
Novus), anti-Flag monoclonal antibody (M2, Sigma-Aldrich), mouse monoclonal 
anti-HA (HA-11, Covance Research Product), anti-EGLN1(PHD2) monoclonal 
antibody (D31E11, Cell Signaling), anti-GLUT1 polyclonal antibody (NB300-666, 
Novus Biologicals), mouse monoclonal anti-tubulin (B-512, Sigma-Aldrich) or 
anti-vinculin monoclonal antibody (Sigma-Aldrich). Bound proteins were 
detected with horseradish-peroxidase-conjugated secondary antibodies (Pierce) 
and Immobilon western chemiluminescent horseradish peroxidase substrate 
(Millipore). 

LC-MS. Metabolite levels in samples were determined by negative mode electro- 
spray LC-MS as previously described’. Briefly, metabolites were extracted from 
exponentially growing cells using 80% aqueous methanol (—80°C) and were 
profiled by LC-MS. (R)-2HG, 2-oxolutarate and succinate were quantified by 
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LC-MS in negative mode using multiple reaction monitoring (MRM) on a 
Quattro micro triple quadruple mass spectrometer (Waters). Samples were diluted 
with equal amounts of 25% acetonitrile and 10 1l aliquots were analysed in trip- 
licate. The HPLC column (Luna NH2, 3 pm, 2.0 X 100mm, Phenomenex) was 
operated isocratically with 130 mM ammonium acetate pH 5.0 in 37% acetonitrile/ 
water. The MRM transitions were: 117 > 73, 117 >93 (succinate), 145 > 57, 
145 > 101 (2-OG) and 147> 85, 147 > 129 ((R)-2HG), 0.2s dwell time for all 
transitions. Calibration curves were set up with Quant Lynx, using standards 
dissolved in reaction buffer. 

Enzyme activity assays. (R)-2HG (H8378) and (S)-2HG (S765015) were from 
Sigma. (R)-2HG was free of contaminating 2-OG as determined by LC-MS under 
conditions that could detect 0.37 [1M exogenous 2-OG in 5 mM (R)-2HG (data not 
shown). Human EGLN1, 2 and 3, collagen P4H-I and FIH, and murine Tet1 and 
Tet2 were produced in insect cells and purified as described earlier**~*’. The 
plasmids to generate the baculoviruses coding for Tet1 and 2 were a gift from Y. 
Zhang (University of North Carolina). IC;9 values for (R)-2HG and (S)-2HG were 
determined based on the hydroxylation-coupled stoichiometric release of '*CO, 
from 2-oxo-[1-'*C]glutarate using synthetic peptides or double stranded oligonu- 
cleotides representing the natural targets of the studied enzymes as substrates. 
These were DLDLEMLAPYIPMDDDFQL (DLD19) for EGLNI, 2 and 3, (PPG); 
for collagen P4H-I, DESGLPQLTSYDCEVNAPIQGSRNLLQGEELLRAL for FIH 
and 5’-CTATACCTCCTCAACTT(mC)GATCACCGTCTCCGGCG-3’ for Tet1 
and 2. The K; values for (S)-2HG for EGLNI1, 2 and 3 were determined by adding 
(S)-2HG in four constant concentrations while varying the concentration of 
2-oxo-[1-4C] glutarate. 

To study whether (R)-2HG, which failed to efficiently inhibit EGLN activity, 
could promote EGLN activity by acting as a cofactor in the place of 2-OG, we 
determined the amount of 4-hydroxy[*H]proline formed by a specific radioche- 
mical procedure** using a L-[2,3,4,5-’H] proline-labelled HIF-1a ODDD as a sub- 
strate. Collagen P4H-I and a [‘*C]-proline-labelled protocollagen® substrate were 
used as controls (detecting 4-hydroxy['*C] proline), and 2-OG (non-labelled) and 
(S)-2HG were assayed for comparison. The K,, values of EGLN1 and EGLN2 were 
determined by adding increasing amounts of (R)-2HG while the concentration of 
the substrate and other cofactors were kept constant. 

Generation of a recombinant HIF-1o@ substrate. The HIF-1~ ODDD substrate, 
spanning residues 356-603 of human HIF-1a, was produced in a BL21(DE3) E. 
coli strain (Novagen) in the presence of L-[2,3,4,5-’H] proline (75Cimmol |, 
PerkinElmer Life Sciences) and affinity purified in a chelating Sepharose column 
charged with Ni?* (ProBond, Invitrogen) exploiting a C-terminal His tag“. 
Concentration of the purified substrate was measured by RotiQuant (Carl Roth 
GmbH) and it was used at K,,, concentrations for the distinct EGLN enzymes”. 
Modelling. EGLN1 active site structure (PDB accession 3HQR”) with 
N-oxalylglycine (NOG, magneta), Mg** and a peptide substrate (light blue), 
was used to model (R)-2HG (green) and (S)-2HG (cyan) into the active site. 
Gene expression profiling and gene-set enrichment analysis. Expression data 
from human brain tumour samples were obtained from The Cancer Genome Atlas 
(http://tcga-data.nci.nih.gov/tcga/tcgaHome2.jsp) and processed as described*”*. 
In short, 200 expression profiles from glioblastoma multiforme and two non- 
neoplastic brain samples were generated using three platforms (Agilent 244K, 
Affymetrix HT-HG-U133A, Affymetrix HuEx), preprocessed using gene centric 
probe sets and three expression values were integrated through factor analysis. 
After consensus clustering using 1,740 variably expressed genes, profiles with a 
negative silhouette metric were identified and removed from the data set, leaving 
expression profiles from 173 glioblastoma tumour samples. IDH1 mutation status 
was established for 116 out of the 173 samples. 

Eleven gene sets representing response to induced hypoxia were reported in 
supplementary table 6 from ref. 41. One gene set was from figure 4C in ref. 9. A 
final set was assembled by one of us (W.G.K.) based on a literature review of 
genes upregulated by HIF in a wide variety of cell types (CA9, EGLN1, EGLN3, 
SLC2A1, BNIP3, ADM, VEGF, PDK1, LOX, PLOD1, CXCR4, P4HA1, 
ANKRD37). Single-sample gene-set enrichment analysis (GSEA) was applied 
as reported previously’. Briefly, genes were ranked by their expression values. 
The empirical cumulative distribution functions (ECDF) of both the genes in the 
signature as well as the remaining genes were calculated. An enrichment score 
was obtained by a sum of the difference between a weighted ECDF of the genes in 
the signature and the ECDF of the remaining genes. This calculation was 
repeated for all signatures and samples. Z-score transformation was applied to 
be able to make scores from different gene sets comparable. A positive score 
indicates gene set activation. A negative value does not indicate inactivation, but 
rather a lack of effect. 

Cell proliferation assays. Cells were plated in 96-well plates (~700 cells per well) 
with a media change every 3 days. The number of viable cells per well at each 
time point was measured using an XTT assay (Cell Proliferation Kit II, Roche) 
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according to the manufacturer’s instructions. Spectrophotometrical absorbance at 
450 nm was measured 5-6 h after adding the XTT labelling reagent/electron coup- 
ling reagent using a microtiter plate reader (Perkin Elmer Life and Analytical 
Science). For direct cell counting, cells were plated in p60 dishes (~50,000 cells 
per dish) with a media change every 3 days. The number of viable cells at each time 
point was measured after trypan blue staining by using an automated cell counter 
(Invitrogen) according to the manufacturer’s instructions. 

Soft agar colony formation assay. Approximately 8,000 cells were suspended in 
a top layer of 0.4% soft agar (SeaPlaque Agarose, BMA products) and plated on 
a bottom layer of 1% soft agar containing complete DMEM supplemented 
with 10% FBS in 6-well plates. After 3-4 weeks, colonies were stained with 0.1% 
iodonitrotetrazolium chloride (Sigma-Aldrich). 

Real-time qPCR analysis. Total RNAs were extracted with Trizol reagent 
(Invitrogen). cDNA synthesis and PCR amplification were performed with 
Superscript One-Step RT-PCR (Invitrogen) with 2 pg total RNA. EGLN3 
cDNA was amplified with sense primer (5’-GCGTCTCCAAGCGACA-3’) and 
antisense primer (5’-GTCTTCAGTGAGGGCAGA-3’). VEGF cDNA was amp- 
lified with sense primer 5'-CGAAACCATGAACTTTCTGC-3’) and antisense 
primer 5'-CCTGAGTGGGCACACACTCC-3’). HIFIA cDNA was amplified 
with sense primer (5’-TATTGCACTGCACAGGCCACATTC-3’) and antisense 
primer (5’-TGATGGGTGAGGAATGGGTTCACA-3’). HIF2A was amplified 
with sense primer (5'-ACAAGCTCCTCTCCTCAGTTTGCT-3’) and antisense 
primer (5’-ACCCTCCAAGGCTTTCAGGTACAA-3’). GLUT1 CAIX cDNA was 
amplified with sense primer (5'-TGGAAGAAATCGCTGAGGAAGGCT-3’) and 
antisense primer (5'-AGCACTCAGCATCACTGTCTGGTT-3’). PDK1 cDNA 
was amplified with sense primer (5'-ATGATGTCATTCCCACAATGGCCC-3’) 
and antisense primer (5'-TGAACATTCTGGCTGGTGACAGGA-3’). As a control, 
B-actin cDNA was amplified with sense primer (5’'-ACCAACTGGGACGACA 


TGGAGAAA-3’) and antisense primer (5’-TAGCACAGCCTGGATAGCAA 
CGTA-3’) 
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Crystal structure of a concentrative nucleoside 
transporter from Vibrio cholerae at 2.4 A 


Zachary Lee Johnson', Cheom-Gil Cheong! & Seok-Yong Lee! 


Nucleosides are required for DNA and RNA synthesis, and the 
nucleoside adenosine has a function in a variety of signalling pro- 
cesses'”. Transport of nucleosides across cell membranes provides 
the major source of nucleosides in many cell types and is also 
responsible for the termination of adenosine signalling. As a result 
of their hydrophilic nature, nucleosides require a specialized class 
of integral membrane proteins, known as nucleoside transporters 
(NTs), for specific transport across cell membranes. In addition to 
nucleosides, NTs are important determinants for the transport of 
nucleoside-derived drugs across cell membranes*°. A wide range 
of nucleoside-derived drugs, including anticancer drugs (such as 
Ara-C and gemcitabine) and antiviral drugs (such as zidovudine 
and ribavirin), have been shown to depend, at least in part, on NTs 
for transport across cell membranes**’. Concentrative nucleoside 
transporters, members of the solute carrier transporter superfamily 
SLC28, use an ion gradient in the active transport of both nucleosides 
and nucleoside-derived drugs against their chemical gradients. The 
structural basis for selective ion-coupled nucleoside transport by 
concentrative nucleoside transporters is unknown. Here we present 
the crystal structure of a concentrative nucleoside transporter from 
Vibrio cholerae in complex with uridine at 2.4 A. Our functional 
data show that, like its human orthologues, the transporter uses a 
sodium-ion gradient for nucleoside transport. The structure reveals 
the overall architecture of this class of transporter, unravels the 
molecular determinants for nucleoside and sodium binding, 
and provides a framework for understanding the mechanism of 
nucleoside and nucleoside drug transport across cell membranes. 

Humans have three isoforms of concentrative nucleoside trans- 
porter (hCNT), and the substrate specificities and tissue distributions 
of these isoforms are different'*'°. Knowledge of the mechanism of 
these transporters would help us not only to understand physiological 
processes associated with nucleosides but also to provide a framework 
for future drug design to improve nucleoside drug delivery. The major 
barrier to achieving a mechanistic understanding is the lack of atomic 
structures that reveal the origins of nucleoside specificity and the 
principles of function of CNTs. For structure determination and func- 
tional studies we chose a CNT homologue from Vibrio cholerae 
(vcCNT) because of its high sequence homology to hCNTs (39% 
identical to hCNT3; Supplementary Fig. 1) and optimal biochemical 
stability. 

To test whether vcCNT can transport nucleosides, and if so what ion 
gradient vcCNT uses, we performed a radioactive nucleoside uptake 
assay using recombinant vcCNT-reconstituted liposomes. Studies have 
shown that the human CNTs use Na” ions and the Escherichia coli CNT 
uses H* for nucleoside transport'”’. Transport activity was measured 
by monitoring the uptake of [5,6-*H]uridine in both the presence and 
the absence of a Na” gradient (Fig. 1a). We chose uridine because all of 
the CNTs that have been characterized so far transport uridine'*’”"”. 
When a Na’ gradient was present, vesicles containing vcCNT accumu- 
lated radioactive uridine significantly more than the control empty 
vesicles. When a Na‘ gradient was not present (Na~ was replaced with 
choline in the solution), uridine uptake was significantly decreased. 
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Figure 1 | vcCNT is a Na‘ -coupled nucleoside transporter with a trimeric 
architecture. a, Time course of the uptake of 2.4 uM [5,6-*H]uridine into 
vesicles containing vcCNT in the presence ofa Na’ gradient (squares, NaCl), in 
the absence of a Na~ gradient (triangles, choline chloride), and the control 
empty vesicles in the presence of Na* (diamonds, empty). Results are 
means = s.d. (n = 3). b, Cartoon representation of the vcCNT trimer viewed 
from the cytoplasm. Individual protomers are coloured blue, red and green. 
c, Cartoon representation for the vcCNT trimer viewed parallel to the 
membrane. The putative membrane bilayer is indicated in c with grey boxes. 
The dimensions of the putative membrane bilayer, extracellular region, and 
intracellular region are shown. 
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Further characterization showed that uridine uptake was electrogenic 
and independent of pH gradients (Supplementary Fig. 2). These data 
clearly show that vcCNT, like human CNTs, uses a Na” gradient to 
transport nucleosides. 

We crystallized and solved the structure of vcCNT at 24A 
(Supplementary Table 1). Experimental phases to 3.5 A were obtained 
by single anomalous dispersion (SAD) from a platinum-soaked crystal 
(see Methods). The final model contains a single protomer in the 
asymmetric unit and is of good quality with a free R factor of 22.8% 
(Supplementary Table 1). 

vcCNT crystallizes as a homotrimer that is shaped like an inverted 
triangular basin with its mouth facing the intracellular side and a knob- 
like structure facing the extracellular side (Fig. 1b, c). The three-fold 
axis coincides with the crystallographic six-fold axis and is perpendic- 
ular to the membrane. When viewed from the intracellular side, each 
side of the triangle formed by the trimer is about 92 A, and when 
viewed parallel to the membrane the trimer is about 57 A in height 
(Fig. 1). The membrane-embedded region lies roughly in the middle of 
the transporter, judging from the positions of three amphipathic helices 
on each protomer. On the basis of the predicted location of the 
membrane bilayer, the mouth of the basin penetrates into the mem- 
brane plane (Fig. 1c and Supplementary Fig. 3). The presence of several 
polar amino acids on the basin surface probably allows bulk aqueous 
solution to reach deep into the membrane bilayer and access this 
surface of the transporter. 

The structure reveals that each protomer contains eight trans- 
membrane helices (TM1-TM8), two re-entrant helix—turn-helix 
hairpins (HP1 and HP2) with opposite orientations in the membrane, 
and three interfacial helices (IH1-IH3) that run parallel to the 
membrane (Fig. 2a). The orientation of the structure relative to the 


Figure 2 | Topology and fold of the vcCNT protomer. a, Schematic 
representation of vcCNT topology. The group of helices under the pink 
triangular background is related to the group of helices under the cyan 
triangular background by two-fold pseudo-symmetry, with the symmetry axis 
parallel to the membrane. b, Cartoon representation of the vcCNT protomer 
fold. Only helices comprising the scaffold domain and the group of helices 
under the pink triangular background are labelled. c, As in b but rotated 
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membrane is consistent with previous accessibility studies” and con- 


forms to the positive-inside rule (Supplementary Fig. 4). hCNTs are 
predicted to contain three more amino-terminal transmembrane helices 
than prokaryotic CNTs, suggesting an 11-TM topology for hCNTs”. 

Each protomer can be grouped into two subdomains on the basis of 
their locations (at the outer and inner regions) relative to the centre of 
the protomer. TM1, TM2, IH1, EH (a short stretch of extracellular 
helices), TM3 and TM6 are located at the outer part of the transporter, 
and they seem to be important for maintaining the overall architecture 
of the transporter. These helices in the outer region form a scaffold for 
the transporter (hereafter termed the scaffold domain) (Fig. 2). 
Trimerization contacts are mediated by part of the scaffold domain: 
TH1, EH, TM3 and TM6 (Fig. 2b). IH1 is an amphipathic helix 40 A 
long that is most probably situated at the water/membrane interface, 
and TM6 is about 60 A long and tilted almost 60° with respect to the 
membrane normal. EH protrudes into the extracellular solution and is 
about 16 A in length. Because IH1 is involved in trimerization, is con- 
strained to be at the membrane/water interface and interacts with 
many TMs (TM2-TMS), it is most probably important in building 
and maintaining the overall structure of vcCNT. 

Surrounded by the scaffold domain, many conserved amino acids 
implicated in nucleoside transport are localized at the inner domain 
(hereafter termed the transport domain). The transport domain is 
composed of two structural groups that are related by an internal 
two-fold pseudo-symmetry. The first group is composed of IH2, 
HP1, TM4a/b and TMS (pink background in Fig. 2a), and the second 
group is composed of IH3, HP2, TM7a/b and TM8 (cyan background 
in Fig. 2a). These two groups, separated by TM6 in sequence, can be 
superimposed with a root mean squared deviation of 3.3 A, with the 
two-fold symmetry operator running though the middle of TM6 and 


through 120°. Only TM1, TM6 and helices comprising the group of helices 
under the cyan triangular background are labelled. d, As in c but rotated 
through 120°. The scaffold domain has been removed to show the pseudo-two- 
fold relationship between the two groups of helices, coloured as pink and cyan 
triangular backgrounds. The two-fold symmetry axis runs through the 
nucleoside at the centre of the transporter. 
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parallel to the membrane (Fig. 2d). There is no significant amino-acid 
sequence homology between these two groups (roughly 10% sequence 
identity). This two-fold symmetry relationship positions the tips of 
HP1, HP2 and unwound regions of TM4 and TM7 at the centre of 
the transport domain, which is located slightly below the middle of the 
membrane plane (Fig. 2d). Sequence alignment of hCNTs and vcCNT 
reveals high sequence conservation around HP1, HP2 and the two 
unwound helices (TM4 and TM7), indicating the functional import- 
ance of this region (Supplementary Fig. 1). 

To the best of our knowledge, the overall fold of vcCNT is novel, 
although local structural elements such as helical hairpins and 
unwound helices have been observed previously**”*. 

The crystal structure suggests that vcCNT adopts a trimeric con- 
figuration. To test whether the stoichiometry of vcCNT is trimeric and 
further validate the physiological relevance of our crystal structure, we 
performed structure-guided disulphide bridge crosslinking experi- 
ments (Supplementary Fig. 5). Our cysteine mutants readily form 
disulphide-crosslinked trimers in both detergent micelles and cell 
membranes under oxidizing conditions; therefore our crystal structure 
reflects a physiologically relevant oligomerization state. Given the 
sequence conservation of most of the amino acids involved in trimer- 
ization, we propose that the stoichiometry of both eukaryotic and 
prokaryotic CNT family members is trimeric (Supplementary Fig. 1). 

The structure contains three deep clefts (one per subunit) at the 
intracellular side facing the centre of the trimer (Fig. 3a). A simulated- 
annealing OMIT map clearly shows that the electron density in the 
cleft is that of uridine (Fig. 3b). The bound uridine in the cleft faces the 
intracellular basin of the transporter; however, it is not free to be 


Figure 3 | Nucleoside-binding site and Na‘ -binding site. a, Cut-away 
surface representation of vcCNT viewed parallel to the membrane. The putative 
membrane bilayer is shown by horizontal lines. Uridine is shown as spheres. 
b, The vcCNT protomer viewed from the centre of the trimer. The scaffold 
domain is shown in ribbon representation; the transport domain is shown in 
cartoon representation. The blue mesh, covered by TM6 (green), is an F, — F. 
simulated annealing OMIT map, contoured at 40, showing density for uridine. 
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released into the intracellular solution because TM6 and TM7b partly 
cover the binding site (Fig. 3a, b and Supplementary Fig. 6). 

The nucleoside-binding site is located at the centre of the internal 
two-fold symmetry and lined by the tips of HP1 and HP2 and the 
unwound regions of TM4 and TM7 (Fig. 3b,c). Inspection of the 
interactions between uridine and the binding site shows that many 
polar or charged amino acids from HP1, HP2, TM4 and TM7 interact 
with the uracil base and ribose. HP1 and TM4b are responsible for 
interacting with the uracil base. The side chains of Gln 154, Thr 155 
and Glu 156 from HP1 interact with the uracil base either directly 
(Gln 154) or indirectly through a water molecule (Thr155 and 
Glu 156; Fig. 3c). Val 188 from TM4b interacts with the uracil base 
by means of van der Waals interactions. The involvement of these 
amino-acid residues in the interactions with the nucleoside is consist- 
ent with previous mutational studies’: the residue corresponding to 
Gln 154, together with that corresponding to Val 188, is important for 
nucleoside specificity of hCNTs; Glu 156 is critical for Na * -nucleoside 
coupled transport. 

HP2 and TM7 are responsible for the interactions with ribose. The 
side chains of Glu332 (HP2), Asn 368 (TM7) and Ser 371 (TM7) 
interact with the ribose either directly (Glu 332, Asn 368 and Ser 371) 
or indirectly through a water molecule (Asn 368). Mutation of the 
residue corresponding to Glu 332 in hCNTs has been shown to have 
significant functional effects on both nucleoside binding and the rate of 
transport’’”®. 

Because vcCNT is a Na*-coupled transporter, it must contain at 
least one Na‘ -binding site. Initial hints regarding the location of the 
Na‘ -binding site came from an F, — F, map that shows a strong peak 


The inset is a zoomed-in image of the simulated annealing OMIT electron 
density. c, The nucleoside-binding site, showing HP1 (red), TM4b (orange), HP2 
(blue) and TM7b (cyan). Hydrogen bonds are shown as dashed lines. d, The 
Na‘ -binding site is located between HP1 (red) and the unwound region of TM4 
(orange). The blue mesh is an F, — F. simulated annealing OMIT map, 
contoured at 6a, showing density for Na*. Coordination of the Na“ ion is shown 
as dashed lines. e, The Na‘ -binding site is near the nucleoside-binding site. 
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HP 1 


Inward facing Outward facing 


Figure 4 | Hypothetical mechanism of nucleoside transport. Only a single 
protomer is shown for simplicity, viewed from the centre of the trimer (same 
orientation as in Fig. 3b). On the left, the transporter adopts an inward-facing 
conformation. On the right, the transporter adopts an outward-facing 
conformation. Uridine is bound between the tips of HP1 and HP2, and TM6 
(grey), serving as a hydrophobic barrier, partly covers uridine. The blue dotted 
lines demarcate the mobile part of the transport domain, and the grey region 
denotes the scaffold domain. Green circles represent sodium ions. A rigid-body 
motion of the nucleoside-binding site across TM6 can expose the binding site to 
the extracellular side. 


(about 80) between the tip of HP1 and the unwound region of TM4. 
Further inspection reveals that this site is octahedrally coordinated by 
three backbone carbonyl groups, two side-chain hydroxyl groups anda 
water molecule (Fig. 3d) with the typical distances (about 2.4 A) for 
Na™ coordination by protein’. Because it is difficult to distinguish 
Na‘ from H,0 on the basis of electron density alone, we performed 
the valence test that was used to identify the Na” sites in the crystal 
structure of LeuT*”’. The result (v = 1.02) is consistent with Na‘ as 
the bound ion in this site. Therefore we suggest that vcCNT has at least 
one Na’ -binding site at this position between HP1 and the unwound 
region of TM4. 

The Na -binding site is near the nucleoside-binding site, but the Na* 
does not interact directly with the nucleoside (Fig. 3e). What is the 
functional role of Na* binding at this site? Because key amino-acid 
residues for the interactions with the nucleoside base are localized on 
HP1 and TM4b, we propose that the role of Na” -bindingis to bring HP1 
close to TM4 to complete the formation of the nucleoside-binding site. 

In the field of Na*-coupled secondary transporters, the prevailing 
model for ion-coupled transport is described by the alternating-access 
mechanism*”. If we assume that the alternating-access model is used, 
our crystal structure probably represents an inward-facing occluded 
conformation (Fig. 3a and Supplementary Fig. 6). On the basis of our 
crystal structure, previous accessibility studies of hCNTs, and studies 
of the Na*-coupled aspartate transporter Gltp,, which shares certain 
structural features with vcCNT (Supplementary Fig. 7), we propose a 
hypothetical model to explain Na“ -coupled nucleoside transport by 
vcCNT (Fig. 4). A rigid-body movement of the transport domain 
across TM6 while the scaffold domain is held in place could permit 
the transition from an inward-facing to an outward-facing conforma- 
tion. Our reasoning in proposing this mechanism is provided in 
Supplementary Information. 

The crystal structure provides an important step towards under- 
standing the mechanism of nucleoside transport by the CNT family. 
However, many important issues remain to be elucidated, including 
the structural basis for nucleoside and nucleoside-drug specificity and 
the conformational changes associated with each step along the trans- 
port cycle. 


METHODS SUMMARY 


vcCNT was expressed in E. coli C41 cells. Cells were lysed and protein was solubilized 
with dodecyl maltoside. Protein was purified by Co’*-affinity chromatography 
followed by gel-filtration chromatography in the presence of decyl maltoside and 
uridine. 
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vcCNT was crystallized in the presence of 100 mM CaCl,, 37-42% PEG400 and 
100 mM Tris-HCl pH 9.0 or 100 mM glycine pH 9.5. Platinum derivatives were 
prepared by soaking the crystals in K,Pt(CNS)¢. A partial poly-Ala model was 
built with phases to 5.0 A from single isomorphous replacement with anomalous 
scattering (SIRAS). A complete model was built by combining phases from single 
anomalous dispersion using a new platinum derivative with molecular replacement 
phases from the partial model (MR-SAD) to 3.5 A. Structure refinement was then 
performed against the 2.4-A native data. The final model is of good quality with 
good Ramachandran statistics (98% favoured and 2% allowed). The model 
contains residues 2-416 (residues 230-240 missing), a uridine, 5 decylmaltosides 
(1 full and 4 partial), a Na* ion and 43 bound waters. Disordered residues Lys 226 
and Glu 345 were modelled as Ala. 

For the flux assay, protein was reconstituted into lipid vesicles and then vesicles 
were diluted into buffer containing either choline chloride or NaCl. Flux was 
initiated by the addition of 2.4 1M [5,6-*H]uridine and 1 1M valinomycin. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Expression and purification. The gene encoding vcCNT was cloned from Vibrio 
cholerae genomic DNA into a modified pET26 vector that contains a pelB leader 
sequence and a PreScission Protease cleavable His, -maltose-binding protein 
fusion. The original vector was a gift from R. Dutzler (University of Zurich) and 
was further modified. The vector containing vcCNT was expressed in E. coli C41 
(DE3) cells. Cells were lysed with a homogenizer (Avestin) and protein was 
solubilized with 30mM dodecyl maltoside (DDM) for 2h at 4°C. Solubilized 
lysates were spun down to remove the insoluble fraction, and supernatants were 
applied to Talon Co** affinity resin. After binding, protein was eluted with 
imidazole and digested overnight with PreScission Protease. The digestion 
mixture was concentrated and applied to a Superdex 200 size-exclusion column 
in the presence of 5mM decyl maltoside (DM), 1 mM uridine, 150 mM NaCl, 
20mM Tris-HCl pH 8.0, 2mM dithiothreitol. Peak fractions corresponding to 
vcCNT were collected for further experiments. 

Crystallization. Initial crystallization conditions were obtained at the high- 
throughput crystallization-screening laboratory at the Hauptman-Woodward 
Institute*!. Protein was concentrated to about 10mgml~! and mixed 1:1 with 
crystallization solution containing 100mM CaCl, 37-42% PEG400 and 
100 mM buffer. Crystals grew over a wide pH range (5.6-9.5), but data for struc- 
ture solution were collected on crystals grown at pH 9.0 (100 mM Tris-HCl) and 
pH9.5 (100 mM glycine). Crystals were grown by using the microbatch-under-oil 
method. After 10-14 days, crystals were harvested, transferred to cryo solution 
containing 32.5% PEG400, and flash-frozen in liquid nitrogen. Platinum deriva- 
tives were prepared by soaking for 2-4h in 2.5 mM K,Pt(CNS), and then trans- 
ferring them to cryo solution and flash-freezing. 

Structure determination. The data were collected on beamlines 22ID, 22BM and 
24ID-C at the Advanced Photon Source. The data were processed with HKL2000 
(ref. 32). Crystals of veCNT diffract to 2.4 A Bragg spacings and belong to the space 
group P63. Extensive screening of crystals was performed because roughly 70% of 
crystals are merohedrally twinned, with the twinning operator perpendicular to 
the crystallographic six-fold axis, leading to the apparent space group P6322. We 
found that SeMet-substituted crystals were almost always twinned with significant 
twinning fractions (20-45%), which necessitated heavy-atom-soaked crystals as 
the choice for a de novo phasing method for the vcCNT structure. Initial phases to 
5.0 A were obtained by single isomorphous replacement with anomalous scatter- 
ing (SIRAS) from a platinum-soaked crystal. Platinum sites were found by 
SHELXD*; phasing was calculated to 5.0 A Bragg spacings by using SOLVE™ with 
the figure of merit 0.34, and the density was modified by solvent flattening with 
RESOLVE”. A partial model was built by manually placing idealized poly-Ala 
helices into the solvent-flattened electron density map by using COOT™. After 
further extensive screening of platinum derivatives, we found a derivative that 
diffracted to 3.1 A with significant anomalous signal but was non-isomorphous 
with any of our native data. Platinum sites were found from an anomalous differ- 
ence Fourier map by using the partial model phases. Combined phases of single 
anomalous dispersion from the platinum derivative and molecular replacement 


from the partial model (MR-SAD) were calculated at 4.0 Awith the figure of merit 
0.35 and extended to 3.5 A by solvent flattening with PHASER” and RESOLVE* 
with the use of the PHENIX interface™. After iterative cycles of manual adjustment 
of poly-Ala helices and calculation of combined MR-SAD phases, the electron 
density map was of excellent quality and allowed us to place side chains into the 
partial model. We also collected native data to 2.4 A at a long wavelength (1.6 A) 
and identified sulphur sites by using an anomalous difference Fourier map, which 
helped guide the model building. After about 70% of manual model building was 
complete, molecular replacement was performed with the partial model against 
the 2.4 A native data for further model building and refinement. Structure refine- 
ment was performed with PHENIX™. An anomalous difference Fourier map with 
the native data to 2.4 A collected at a long wavelength (1.6 A) identified six sulphur 
sites and helped guide the model building. The final model is of good quality with 
Rwork/Riree = 19.6/22.8% and good Ramachandran statistics (98% favoured and 
2% allowed), and contains residues 2-416 (residues 230-240 missing), a uridine, 5 
DMs (1 full and 4 partial), a Na‘ ion and 43 bound waters. 

Vesicle reconstitution and flux assay. Protein was reconstituted into lipid vesicles 
containing 10 mg ml ' of 1-Palmitoyl-2-oleoyl-sn-glycero-3-phosphoethanolamine 
(POPE) and 1-Palmitoyl-2-oleoyl-sn-glycero-3-phosphoglycerol (POPG) (3:1 ratio 
of POPE:POPG) at a mass ratio of 1:500 protein:lipid, as described previously”. 
Vesicles were reconstituted in the presence of 200mM KCl, 20mM HEPES 
pH7.4, 100mM choline chloride. Reconstituted vesicles were then flash-frozen 
and thawed three times, then extruded through a 1.0-um filter with the use of the 
Avanti Mini-Extruder. 

For the flux assay, vesicles were diluted 1:20 into buffer containing 200 mM KCl, 
20 mM HEPES pH 7.4, and either 100 mM choline chloride or 100 mM NaCl. Flux 
was initiated by the addition of 2.4 1M [5,6-*H]uridine and 1 1M valinomycin. All 
experiments were performed in triplicate at 30 °C. Vesicles were harvested on GF/B 
glass microfibre filters (Whatman) and counted by scintillation on the following day. 


31. Koszelak-Rosenblum, M. et al. Determination and application of empirically 
derived detergent phase boundaries to effectively crystallize membrane proteins. 
Protein Sci. 18, 1828-1839 (2009). 

32. Otwinowski, Z. & Minor, W. Processing of X-ray diffraction data collected in 
oscillation mode. Methods Enzymol. 276, 307-326 (1997). 

33. Sheldrick, G. M.A short history of SHELX. Acta Crystallogr. A 64, 112-122 (2008). 

34. Terwilliger, T. C. & Berendzen, J. Automated MAD and MIR structure solution. Acta 
Crystallogr. D Biol. Crystallogr. 55, 849-861 (1999). 

35. Terwilliger, T. C. Maximum-likelihood density modification. Acta Crystallogr. D Biol. 
Crystallogr. 56, 965-972 (2000). 

36. Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta 
Crystallogr. D Biol. Crystallogr. 60, 2126-2132 (2004). 

37. McCoy, A. J. et al. Phaser crystallographic software. J. Appl. Cryst 40, 658-674 
(2007). 

38. Adams, P. D. et a/. PHENIX: a comprehensive Python-based system for 
macromolecular structure solution. Acta Crystallogr. D Biol. Crystallogr. 66, 
213-221 (2010). 

39. Lee, S. Y., Letts, J. A. & MacKinnon, R. Functional reconstitution of purified human 
Hv1 H* channels. J. Mol. Biol. 387, 1055-1060 (2009). 


©2012 Macmillan Publishers Limited. All rights reserved 


[LER 


doi:10.1038/nature10881 


Identification and characterization of a bacterial 
hydrosulphide ion channel 


Bryan K. Czyzewski!? & Da-Neng Wang'* 


The hydrosulphide ion (HS ) and its undissociated form, hydrogen 
sulphide (HS), which are believed to have been critical to the origin 
of life on Earth’, remain important in physiology and cellular sig- 
nalling’. As a major metabolite in anaerobic bacterial growth, hydro- 
gen sulphide is a product of both assimilatory and dissimilatory 
sulphate reduction” *. These pathways can reduce various oxidized 
sulphur compounds including sulphate, sulphite and thiosulphate. 
The dissimilatory sulphate reduction pathway uses this molecule as 
the terminal electron acceptor for anaerobic respiration, in which 
process it produces excess amounts of H,S (ref. 4). The reduction 
of sulphite is a key intermediate step in all sulphate reduction 
pathways. In Clostridium and Salmonella, an inducible sulphite 
reductase is directly linked to the regeneration of NAD*, which 
has been suggested to have a role in energy production and growth, 
as well as in the detoxification of sulphite’. Above a certain con- 
centration threshold, both H,S and HS inhibit cell growth by 
binding the metal centres of enzymes and cytochrome oxidase’, 
necessitating a release mechanism for the export of this toxic 
metabolite from the cell’. Here we report the identification of a 
hydrosulphide ion channel in the pathogen Clostridium difficile 
through a combination of genetic, biochemical and functional 
approaches. The HS” channel is a member of the formate/nitrite 
transport family, in which about 50 hydrosulphide ion channels 
form a third subfamily alongside those for formate’®" (FocA) and 
for nitrite’? (NirC). The hydrosulphide ion channel is permeable to 
formate and nitrite as well as to HS” ions. Such polyspecificity can 
be explained by the conserved ion selectivity filter observed in the 
channel’s crystal structure. The channel has a low open probability 
and is tightly regulated, to avoid decoupling of the membrane 
proton gradient. 

Because HS is a weak acid with a pK, of 6.8 and a second pK, of 19, in 
a biological system more than 80% of it will exist in its ionized form as 
HS (refs 2, 13). To account for membrane permeation of intracellularly 
produced HS (refs 6, 9), a channel for HS” or H,S has been suggested*’, 
with the water channel aquaporins as possible candidates*. Whereas 
studies in planar lipid bilayers indicate that the aquaporin from 
Archaeoglobus fulgidus is not permeable to H,S and that the lipid bilayer 
provides little resistance to HS permeation’, in the vent worm Riftia 
pachyptila it has been shown that HS” ions, but not H,S, are selectively 
transported through its outer epithelium into its vasculature’. 

Despite the high permeability of H2S across the lipid bilayer, we 
argue that its weak acidity necessitates a release mechanism for its 
ionized form, HS” (Fig. 1a). As a weak acid with a pK, of 6.8, H2S 
can diffuse readily across the lipid bilayer. Therefore, the distribution 
of H,S and HS on the two sides of the cell membrane is directly 
proportional to the pH differential as described by the Henderson- 
Hasselbalch equation”. Because cellular pH is kept at neutral levels, 
and extracellular pH is typically one to two units more acidic, at 
equilibrium the HS” concentration will be 10-100-fold greater on 
the inside of the cell than on the outside in the absence of a release 


mechanism for the HS anions. Therefore, given its significant 
toxicity, it would be beneficial to directly expel this ion across the 
membrane by the quickest mechanism available: an ion channel. 
Recently, a novel family of anion channels for short-chain acids has 
been functionally characterized''"*"°. Members of the formate/nitrite 
transport (FNT) family transport various anions during anaerobic 
bacterial growth'®'*””. Judging from sequence homology, however, 
only about half of its 2,000 identified members clearly belong to either 
the FocA or the NirC subfamily (Figs 1b, c and Supplementary Figs 1 
and 2). Notably, for both FocA and NirC, each channel gene is 
genetically linked to its reductase partner: focA to the formate reductase 
pfiB’° and nirC to the nitrate reductase nirBD’’ (Fig. 1b). Our phylo- 
genetic analysis of bacterial genes suggests the existence of additional 
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Figure 1 | Genetic analyses and functional characterization of FNT3 and the 
asrABC operon. About 50 FNT genes from Clostridium are linked to the 
sulphite reductase gene asrABC, and they form a third subfamily (FNT3) in the 
ENT family. a, Model of the intracellular anion concentrative effect for the weak 
acid HS. b, Genomic organization of three types of FNT gene and their 
metabolically related reductase genes. Representatives are shown for focA, nirC 
and FNT3 with their respective linked operons. c, Phylogenetic tree of 474 
bacterial and archaeal members of the FNT family. Branches are coloured on 
the basis of genetic linkage to metabolic enzymes: genes linked to pyruvate 
formate lyase (pflBA) or formate dehydrogenase (fdhAB; fdhA and fdhB also 
known as selA and selD, respectively) are coloured pink, genes linked to nitrite 
reductase (nirBD) are coloured blue and genes linked to sulphite reductase 
(asrABC) are coloured green. The FocA protein in archaea is encoded by fdhC 
(selC). The grey areas represent FNT family members with no assigned function 
based on genetic linkage. d, Bismuth sulphite agar plate assay: vector control 
(top); asrA, asrB and asrC (left); asrA, asrB, asrC and FNT3 (right). 
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channel subfamilies in the FNT family (Fig. 1c and Supplementary 
Fig. 1). One such subfamily is a group of ~50 homologous genes, 
temporarily termed FNT3, found in various species of Clostridium that 
are major human pathogens grown under strictly anaerobic conditions. 
As it is linked to a reductase gene, the asrABC operon’® (Fig. 1b), we 
proposed that FNT3 genes might encode a channel for HS or related 
ions. AsrABC reduces sulphite (SO; 7) to sulphide (S-?), which is 
rapidly converted into H2S and HS” under physiological conditions. 

Salmonella asrABC is the only such operon that has been characterized 
thus far. Its overexpression in Escherichia coli, which lacks this bio- 
chemical pathway, was previously shown to reduce SO; * to H,S 
(ref. 19). To verify first whether the homologous asrABC operon from 
C. difficile also encodes an SO3 7 reductase, we tested the growth of 
E. coli transformed with the C. difficile asrABC genes, plated on bismuth 
sulphite agar’’*°. When C. difficile asrABC was induced, E. coli was able 
to overcome the expected growth inhibition by reducing SO; * to HS 
(Fig. 1d and Supplementary Fig. 3). Darkening along the edges of the 
colonies, formed by the precipitation of bismuth and iron sulphide, 
further indicated that H,S gas was being produced. This showed that 
the asrABC genes from C. difficile do indeed encode for an SO3 7 
reductase, which in turn supports the notion that the FNT3 gene 
(‘FNT3’) in the asrABC operon may encode a channel for SO3 7 or 
for its reduced product, HS. 

To determine whether the FNT3 protein from C. difficile functions 
as achannel for HS or SO; *, we then carried out whole-cell trans- 
port, ion-protein binding in solution and transport-inhibition assays 
in reconstituted proteoliposomes. First we transformed sulphide- 
producing Salmonella typhimurium, which lacks an HS” channel, 
with the C. difficile FNT3. In SO, *-supplemented minimal medium, 
the FNT3-expressing S. typhimurium released a high concentration of 
HS ions into the culture medium (Fig. 2a and Supplementary Fig. 4). 
This increase in extracellular HS” was specific to HS produced by the 
endogenous cytoplasmic AsrABC; when the other sulphide-producing 
reductase in the cell, the periplasmic thiosulphate reductase, was 
engaged with its substrate thiosulphate, little effect on the extracellular 
concentration of HS” was observed. This indicates that the measured 
HS” was generated by SO3 * reduction by the cytoplasmic AsrABC 
and then exported. Such an FNT3-linked increase in HS production 
in S. typhimurium can be attributed to either increased import of 
SO; * or increased export of HS” from the cell. It follows that an 
ENT3 protein could be an ion channel for SO; * or HS. 

An ion channel protein is often stabilized by its permeating ions 
through their direct interaction with its selectivity filter’. Therefore, 
we tested whether purified FNT3 protein interacts with the monovalent 
HS~ or the divalent SO; *. Using size-exclusion chromatography of 
purified protein samples incubated at different temperatures, we found 
that at pH 8.0 the presence of HS" ions was able to increase the nominal 
melting temperature of the FNT3 protein by 8 °C, whereas SO; * had 
little effect (Fig. 2b and Supplementary Table 1). This result is consistent 
with the hypothesis that FNT3 protein functions asan HS” channel but 
is probably impermeable to SO; ~. Notably, three other monovalent 
anions, formate, nitrite and chloride, also increased the melting tem- 
perature of the FNT3 protein, but to a lesser degree (3-4 °C). 

We went on to identify the permeable ions of the FNT3 protein 
in vitro. As channels in the FNT family are expected to have low 
conductance rates'', we measured the transport activity of FNT3 
protein using a concentrative uptake assay’'**. Given that HS” ions 
at concentrations needed to set up a sufficient electrochemical gradient 
(~150 mM) are severely disruptive to the membrane, we chose to 
measure the permeability of FNT3 protein using another interacting 
ion, formate (Fig. 2b). Proteoliposomes reconstituted at pH 8.0 from 
purified C. difficile FNT3 protein were found to be permeable to formate, 
although at a lower rate than FocA from Vibrio cholerae (Fig. 2c). This 
permeability to formate by FNT3 protein allowed us to search for other 
ions that directly compete with formate for transport. Indeed, when 
HS" or nitrite was present in the buffer outside the proteoliposomes, 
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Figure 2 | Binding and transport activity of HSC in reconstituted 
proteoliposomes. a, Measurements of sulphide concentrations in the media of 
Salmonella transformed with vector control or vector encoding HSC. Minimal 
media were supplemented with either sulphite or thiosulphate to induce the 
production of hydrogen sulphide from either periplasmic thiosulphate 
reductase or cytoplasmic sulphite reductase. OD, optical density. b, Binding of 
detergent-solubilized and purified HSC to various anions was determined by 
using thermostability coupled size-exclusion chromatography. Peak heights of 
recovered samples are plotted against temperature and fitted to a Boltzmann 
sigmoidal model to determine nominal melting temperatures. c, Radiolabelled 
formate uptake in proteoliposomes reconstituted with purified HSC at pH 8.0 
was monitored in a concentrative uptake assay and compared with FocA 
activity or vesicle controls. c.p.m., counts per minute. d, Inhibition of 
radiolabelled concentrative uptake of formate by the addition of various anions 
at increasing concentrations. The graph shows the amount of radiolabelled 
formate measured at the 10-min time point for each concentration of anion 
tested. The concentrations of the competing anions were 0, 0.15, 0.6, 3 and 
15 mM (left to right). Error bars, s.e.m. (N = 3). 


the uptake of formate was drastically inhibited (Fig. 2d). In the physio- 
logical sulphide concentration range (1-2 mM), ~70% inhibition was 
observed, whereas at the highest concentration tested (15 mM), HS— 
and nitrite inhibited the uptake of formate by 90% and 70%, respectively. 

Such direct competitive inhibition (Fig. 2c), coupled with whole- 
cell transport assays (Fig. 2a) and in vitro binding assays (Fig. 2b), 
collectively indicates that the FNT3 protein from C. difficile is likely 
to be a channel for the hydrosulphide ion, HS. Although the channel 
is also permeable to formate and nitrite ions, the FNT3 gene is directly 
linked to the asrABC sulphite reductase operon (Fig. 1b); we have 
therefore tentatively named the FNT3 protein a hydrosulphide ion 
channel (HSC). A definite understanding of the physiological role of 
these channels will need to be confirmed by in vivo experiments such 
as genetic deletion studies. 

To understand the transport mechanism of the HSC protein from 
C. difficile (Supplementary Fig. 5), we determined its crystal structure 
at high pH (pH 9.0) toa resolution of 2.2 A (Fig. 3a and Supplementary 
Table 2). The HSC adopts the aquaporin/FocA fold’'” and forms a 
pentamer like FocA, with the five protomers having the same structure 
(Supplementary Figs 6-8). The two-fold inverted symmetry between 
the two halves of the protein is greater than that observed in FocA and 
extends to the pore-lining transmembrane helices TM2b and TM5b 
(Supplementary Fig. 8b), two salt-bridge triads on the periplasmic and 
the cytoplasmic sides (Lys 148-Glu 200-Asn 205 and Lys 16-Glu 81- 
Asn 86) (Fig. 4a—c), and the two short helices parallel to the membrane 
(helix P on the periplasmic surface and helix N at the amino terminus 
on the cytoplasmic surface) (Supplementary Fig. 8b). This helix N, by 
means of the Lys 16-Glu 81-Asn 86 salt-bridge triad, buttresses TM2b 
and TM65a, making the cytoplasmic side of the protein more closed and 
rigid than that in FocA”. 
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Figure 3 | Structural and functional characterization of the ion permeation 
pathway. a, Structure of a protomer of HSC overlaid with the pore diameter 
calculations made using HOLE, coloured to indicate permeability to water: 
green, permeable; red, impermeable. Transmembrane helix 2 has been removed 
for clarity. b, Pore diameter calculations from HOLE of the crystal structures for 
each of the two permeation pathway mutations, Leu 82 Val and Phe 194 Ile and 
wild type (WT). c, Close-up of the electron density observed at the selectivity 
filter for the Leu 82 Val mutant. The 2F, — F- electron density map contoured 
at 1.1 is coloured purple and the F, — F. difference density map contoured at 
30 is coloured green. d, Relative uptake of proteoliposomes reconstituted with 
purified HSC and pathway mutations. The graph shows the 10-min time point 
of each concentrative uptake experiment. Error bars, s.e.m. (N = 3). 


The pore for ion permeation, which is located at the centre of each 
protomer, contains the selectivity filter formed by a cytoplasmic slit 
and a periplasmic ring in the middle of the membrane (Fig. 3a, b). The 
radius of the cytoplasmic slit is 0.8 A, which is too small for an HS~ ion 
(radius, 1.7 A; ref. 24) to pass through (Supplementary Figs 9-11). 
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Figure 4 | Structural and functional characterization of possible gating 
mechanisms. a, Structure of the HSC protomer representing possible gating 
regions. The structure is coloured to show the two-fold inverted topology. 

b, c, The Glu-Lys-Asn salt-bridge triads, related by a pseudo-two-fold 
symmetry, help to stabilize helix P at the periplasmic side (b) and helix N at the 
cytoplasmic side (c) of the protein. The rotamer that each residue adopts is 
conserved. d, Relative uptake of proteoliposomes reconstituted with purified 
HSC and gating mutations. The graph shows the 10-min time point of each 
concentrative uptake experiment. Error bars, s.e.m. (N = 3). 
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Therefore, the HSC protein in the crystal structure is probably in a 
closed state. As expected from the sequence similarity between HSC 
and FocA in the selectivity filter (Supplementary Fig. 2), the positions 
or orientations of key residues changed only slightly in HSC 
(Supplementary Fig. 10). Such conservation explains the permeability 
of HSC to HS’, as well as to formate and nitrite. When a residue at the 
cytoplasmic slit or the periplasmic ring is mutated into a smaller 
residue, for example Leu 82 to Val or Phe 194 to Ile, the pore size is 
increased, as shown in the crystal structures of the mutant protein 
(Fig. 3b and Supplementary Table 2), and the formate transport rate 
of the channel is significantly increased (Fig. 3d). We note that in the 
crystal structure of the Leu82 Val mutant, electron density was 
observed in the cytoplasmic slit (Fig. 3c). This density, which is greater 
than that of a water molecule—as inferred from a B-factor fourfold 
lower than those of the surrounding atoms—could be accounted for 
by a Cl or an HS” ion. Owing to their similar sizes and chemical 
properties, it has been previously observed that Cl” and HS ions 
often bind to the same site in protein crystal structures”. 

The observed polyspecificity of HSC is typical for anion channels 
and is most probably shared by the FocA and NirC channels as well. 
Such polyspecific anion recognition is at least partly due to the diffuse 
nature of the electron cloud of anions”. By contrast, cation channels 
often show a high degree of selectivity, which is made possible by 
the point-like charge properties of such ions. Because different FNT 
channels in this ancient protein family are coupled with their specific 
metabolic pathways, each channel is expected to have its own preference 
for permeating ions, and that preference is probably due to the small 
structural variations in the selectivity filter that we observed here (Sup- 
plementary Fig. 10). 

The expulsion of toxic concentrations of HS” from the cytoplasm 
through HSC needs to be balanced against the potential risk of the 
uncoupling of the membrane proton gradient. Freely diffusible H,S 
and other weak, short-chain acids will dissociate into an anion and a 
proton in the cytoplasm (Fig. 1a). As the anion is expelled, there will be 
an intracellular net accumulation of protons. Therefore, the ground 
state of HSC is likely to be closed and its opening tightly regulated. 

Indeed, the HSC structure we determined at high pH is in its closed 
state (Fig. 3 and Supplementary Figs 9-11). We then considered how 
easily the channel could be opened. Because the homologous FocA 
channel can be gated by an appropriate change in pH or cytoplasmic 
formate concentration’, we first investigated possible pH-dependent 
structural changes of HSC by determining its crystal structure at pH 7.5 
(resolution, 3.2 A) and at pH 4.5 (3.0 A) (Supplementary Table 2). The 
protein structures obtained at these neutral and low pH values were the 
same as the structure at 2.2 A and pH 9.0 (Supplementary Fig. 12), and 
the channel remained closed. We then studied the two salt-bridge 
triads, both of which contain a glutamate residue that could potentially 
function as a pH sensor (Fig. 4a—c). When either of the salt bridges was 
broken, by mutating Lys 16 to Ser or Lys 148 to Glu, the HSC crystal 
structure did not change and the channel still remained closed 
(Supplementary Fig. 13 and Supplementary Table 2). Consistent with 
these structural results, the transport rates of these mutants did not 
increase (Fig. 4d). 

To determine whether HSC is gated by anion concentration in a 
similar manner to V. cholerae FocA"', in which the competition of 
formate with Thr90 from the Q-loop for hydrogen bonding to 
His 208 in the selectivity filter opens the channel, we mutated the 
equivalent threonine in HSC, Thr 84, to an Ala and determined its 
crystal structure at 2.4 A (Supplementary Table 2). This mutant struc- 
ture did not differ from the closed wild-type channel, and no increase 
in transport was detected (Fig. 4d and Supplementary Fig. 14). Because 
Thr 84, Lys 16 and Lys 148 all line the ion permeation pathway, muta- 
tion of these residues slightly reduced the ion permeation rate of the 
channel. Therefore, we conclude that the opening of HSC is indeed 
tightly regulated and the channel has a much lower open probability 
than FocA'"®, 
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The low open probability of HSC observed here is consistent with its 
physiological role in the cell. The opening of HSC probably requires 
significant movement of TM2b. As TM2b is ‘held’ in space by the 
cytoplasmic helix N (Fig. 4a), the movement of the latter, for example 
by direct interaction with a cytoplasmic enzyme in the same metabolic 
pathway, can trigger the opening of the channel. We have no evidence 
for such a hypothesis, and it can be validated only by further experi- 
mentation, which we expect would reveal greater variations in both the 
gating mechanism and the anionic preference of FNT channels. 


METHODS SUMMARY 

Bismuth sulphite agar plate assay. FNT3 was isolated from C. difficile strain 630. 
Escherichia coli BL21(DE3) transformed with vector pCDFduet-1-asrA-asrB, 
pACYCduet-1-asrC, or pACYCduet-1-asrC-FNT3 were plated on bismuth sulphite 
agar’? and allowed to develop for 48 h. 

Whole-cell in vivo transport. Salmonella typhimurium transformed with pBAD- 
FNT3 were assayed for their ability to generate sulphide from either thiosulphate 
or sulphite using a silver-sulphide probe following modification of a previously 
published protocol’. 

Proteoliposome reconstitution and in vitro transport assay. Proteoliposomes 
were formed from E. coli total lipids and Ni**-NTA-purified proteins and assayed 
by a method modified from a previously published concentrative uptake assay'!”’. 
Protein-ion interaction assay. lon-induced thermostability of HSC was monitored 
using analytical size-exclusion chromatography. Purified protein was incubated at 
increasing temperatures and its recovery was monitored by size-exclusion chromato- 
graphy using a high-performance liquid chromatograph. Candidate permeating 
ions were incubated at a concentration of 20 mM and protein was monitored for 
thermostabilization. 

Protein purification and crystallization and structure determination. HSC was 
expressed as a carboxy-terminal FLAG-10 His fusion in E. coli BL21. The protein 
was purified using Ni**-NTA affinity chromatography, followed by size-exclusion 
chromatography. The protein was crystallized by hanging-drop vapour diffusion 
and the crystal structure was solved by molecular replacement. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Phylogenetic analysis. Phylogenetic data for 474 bacterial and archaeal members 
of the FNT family were obtained from the HOGENOM database”, aligned using 
the program JALVIEW” and plotted using the program ARCHAEOPTERYX”. 
Genomic sequences and operon annotations were derived from the NCBI 
GenBank nucleotide database”. 

Bismuth sulphite agar plate assay. Gene FNT3 was isolated from C. difficile strain 
630. The genes for C. difficile asrA (NCBI gene ID, 4915353) and asrB (4915352) 
were cloned into pCDFDuet-1 (EMD Biosciences) and the genes for C. difficile 
asrC (4915351) and FNT3 (4915350) were cloned into pACYCDuet-1 (EMD 
Biosciences). Bismuth sulphite agar plates” were freshly prepared following the 
manufacturer's protocol and supplemented with 1mM _ isopropyl f-p-1- 
thiogalactopyranoside and the antibiotics chloramphenical and spectinomycin. 
E. coli BL21(DE3) cells were transformed with the indicated vectors and streaked, 
and colonies were allowed to grow at 37 °C for 48 h. The bismuth sulphite agar” 
contained the ATP synthase inhibitor brilliant green’” and high concentrations of 
SO; 7. When C. difficile asrABC was induced, E. coli were able to overcome the 
growth inhibition by producing ATP through glycolysis by reducing SO; 7 to 
H.S. 

Whole-cell in vivo transport assay. Salmonella typhimurium LT2 was trans- 
formed with the FNT3 cloned into a pBAD vector (Invitrogen). Cells were adapted 
in MOPS medium supplemented with 10 mM glucose and thiamine at 1 1g ml *. 
A fraction of each culture (5 ml) was centrifuged and resuspended in fresh media 
supplemented with ampicillin, 0.2% arabinose and either 6 mM sodium sulphite 
or 6mM sodium thiosulphate. Aliquots were incubated at 37 °C for 6h and then 
centrifuged to pellet bacteria. Medium was gently poured into an equal volume of 
sulphide antioxidant buffer. Exported sulphide was measured using a combination 
silver sulphide electrode (Cole-Palmer). The concentration of sulphide in the 
medium was determined using standards of known sodium sulphide concentra- 
tions in the same antioxidant buffer. 

Protein purification. FNT3 was cloned into a modified pBAD vector (Invitrogen), 
creating a C-terminal TEV-FLAG-10* His fusion protein, and transformed into 
E. coli BL21 plysS (Sigma-Aldrich) cells''’**°. Transformed cells were grown at 
37 °C to Doo nm 1-0 and induced with 0.2% of arabinose for 4 h. Cells were collected 
and lysed by passing twice through an Emulsiflex cell disrupter (Avestin). 
Lysate was clarified by a low-speed spin at 12,000g and solubilized with 1% 
B-dodecyl-maltopyranoside (DDM, Anatrace). Solubilized lysate was incubated 
with Ni’?-NTA resin (Qiagen) and the bound protein was eluted with buffer 
containing 1.1% f-octyl-glucopyranoside (OG, Anatrace), 50mM Tris 8.0, 
200mM NaCl, 300mM imidazole and 10% glycerol. The protein sample was 
incubated with TEV protease overnight to remove the tags and then underwent 
size-exclusion chromatography in a Superdex 200 in 25 mM Tris, pH 6.8, 100 mM 
NaCl and 1.1% OG. 

Mass spectrometry. The mass of purified HSC was determined using matrix- 
assisted laser desorption/ionization time-of-flight mass spectrometry in the 
laboratory of Dr T. Neubert following published protocols****. 

Protein-ion interaction assay. Ion-binding-induced thermostability of purified 
protein samples was measured using size-exclusion chromatography. Aliquots of 
100 mg of Ni*?-NTA-purified protein were incubated for 10 min ina thermocycler 
at increasing temperatures. Samples were injected onto a Shodex KW804 analytical 
size-exclusion chromatography column (Thomson) on a high-performance liquid 
chromatograph (Shimadzu) equilibrated in 200 mM Na2SOu,, 50 mM Tris, pH 8.0, 
3 mM NaN; and 0.05% DDM. The height of the monodisperse peak was used to 
quantify the amount of remaining sample. The nominal melting temperature, T,,, 
defined as the temperature at which 50% of the protein remained soluble, was 
calculated by fitting the data to a Boltzmann sigmoidal model in PRISM5 
(GraphPad Software). For screening of various compounds, 20 mM of each sodium 
salt was added before incubation. 

Proteoliposome reconstitution and in vitro transport assay. Transport of 
radiolabelled formate was measured in a concentrative uptake assay'!’. 
Escherichia coli total lipids were aliquoted in chloroform and dried under nitrogen. 
Following resuspension in intraliposomal buffer (150 mM sodium formate and 
50 mM Tris, pH 8.0), lipids were overlayed with nitrogen, freeze-thawed ten times, 


vortexed and sonicated to homogeneity. Proteoliposomes were formed by the 
addition of 1% OG and Ni*?-NTA-purified HSC at a protein-to-lipid ratio of 
1:10,000 (w/w). OG detergent was removed by incubation with 400 mg ml Bio- 
Beads SM2 (Bio-Rad) overnight. Proteoliposomes were then extruded through a 
400-nm membrane filter and frozen at —20 °C until use. Concentrated re-uptake 
of formate was initiated by buffer-exchanging 200 ml of thawed proteoliposomes 
over 2 ml of G50 Sephadex swollen in extraliposomal buffer (150 mM glutamate, 
50mM Tris, pH 8.0). Carbon-14-labelled sodium formate (450 1M, American 
Radiolabelled Chemical) was added to the collected proteoliposomes, and the 
assays were terminated by centrifuging a 20-ml sample at 1,000g for 1 min at each 
time point through a G-50 Probequant Micro-column (GE Healthcare) to remove 
the external radiolabelled formate. The amount of transported radiolabelled 
formate was quantified by adding 1 ml of scintillation fluid to each sample and 
emitted photons were counted using a Wallac scintillation counter. For competi- 
tion assays, increasing concentrations of sodium sulphite, sodium nitrite or 
sodium hydrosulphide (0, 0.15, 0.6, 3 and 15 mM), were added before the addition 
of radiolabelled formate. 

Protein crystallization. High-pH, primitive orthorhombic crystals of HSC were 
grown using the vapour diffusion method, by mixing protein at a concentration of 
11mgml ’ in a 1:1 ratio with reservoir solution containing 25-27% (v/v) PEG 
400, 100mM Tris, pH 7.0-9.0, and 100mM sodium sulphite, with or without 
50 mM sodium hydrosulphide. Medium-pH, monoclinic crystals were grown in 
reservoir solution containing 25-27% (v/v) PEG 400, 100 mM Tris, pH 7.5, and 
100 mM sodium nitrite. Low-pH, C-centred orthorhombic crystals were grown in 
buffer containing 100 mM sodium sulphite, 100 mM sodium nitrite, 8-10% (w/v) 
PEG 6000, 100 mM zinc acetate and 100 mM sodium acetate, pH 4.5. 

Structure determination. X-ray diffraction data were collected at beamlines X25 
and X29 of the National Synchrotron Light Source at Brookhaven National 
Laboratory and beamline 23ID of the Advanced Photon Source at Argonne 
National Laboratory. Images were processed and scaled using the HKL2000 
suite’. The structure was solved by molecular replacement using the V. cholerae 
FocA structure! (PDB code: 3KLY) as a search model. Models were built using 
PHENIX Autobuild* and manually adjusted using COOT”. Refinement was 
carried out using PHENIX Refine*. The pore size of the channel was calculated 
using HOLE”, and structure figures were generated using PYMOL”. 
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CORRIGENDUM 
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Corrigendum: Functional 
complementation between FADD 
and RIP1 in embryos and 
lymphocytes 


Haibing Zhang, Xiaohui Zhou, Thomas McQuade, Jinghe Li, 
Francis Ka-Ming Chan & Jianke Zhang 


Nature 471, 373-376 (2011) 


In Fig. 3b of this Letter, the plot for the NP205 epitope response in the 
Fadd ‘~ Rip1‘~ double-knockout (DKO) splenocytes was a duplicate 
of that for the NP38 epitope response. The correct figure panel is 
shown below. The panel has been corrected in the HTML and PDF 
versions online. 


NP205 


IFNy ——> 


CD44. ———> 


Figure 1 | This is the correct panel from Fig. 3b of the original Letter. 
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Postdoc or not? 


Life-sciences graduates interested in academic research 
typically need to do at least one postdoc. For physics 
students, there are multiple caveats to consider. 


BY KAREN KAPLAN 


jorn Flatt had always assumed that he 

would become an academic physicist. 

After earning his PhD in particle physics 
from Johannes Gutenberg University Mainz in 
Germersheim, Germany, he was happy to land 
a postdoctoral fellowship at Stanford Univer- 
sity in Palo Alto, California. 


But things deteriorated rapidly. The two- 
year fellowship was extended by a further 
two years so that Flatt could complete time- 
intensive experiments, but the delay hampered 
his efforts to break into the permanent-job 
market. By the time he started looking for an 
academic position in 2009, the effect of the 
recession was already widespread. He received 
no interview offers. 


Flatt came to realize that he did not want 
to pursue an academic career after all, and set 
his sights on industry. However, he could not 
find a permanent research post, and ended 
up starting another postdoc, this time in mass 
spectrometry and instrumentation physics at 
Agilent Technologies in Santa Clara, California, 
which was at least a way to get his foot in the 
door. “It wasn’t exactly my field, but my theses 
had been on instrumentation development, 
Flatt says. “It was actually a pretty good match” 
Last March, several months before the postdoc 
was due to end, Flatt heard about an open- 
ing in the company for a senior research and 
development scientist, a permanent position. 
Flatt applied for it and got the job. 

Today, after a year in his new post, Flatt says 
that his two postdocs offered little value other 
than confirming that he did, in fact, want to 
pursue industrial research. They were neither 
crucial for building his research portfolio nor 
obligatory for landing his permanent position 
at Agilent. 

Science postgraduates, especially those in 
the biological sciences, often see postdocs asa 
way to continue and refine their research, learn 
to run a laboratory and develop a broad, deep 
collaborative network. Flatt is one of many 
early-career physicists in the United States who 
dismiss that idea. Although researchers in the 
biological sciences may have to take multiple 
postdocs before landing their first permanent 
post, the landscape is shifting for physicists. 
Nearly 70% of physicists who earned their 
degree in 2004 took a postdoc, but the propor- 
tion had fallen to 56% in the classes of 2007 
and 2008, according to the Statistical Research 
Center of American Institute of Physics (AIP) 
in College Park, Maryland. 

One reason for the decline is that fewer 
people with physics PhDs are pursuing an 
academic research career. This is in part 
because there are, and have been, fewer posi- 
tions to pick from. Academic hiring has been 
flat since 2003, says Crystal Bailey, the educa- 
tion and careers programme manager for the 
American Physical Society (APS), also in Col- 
lege Park. Between 2006 and 2007, about 350 
faculty members left their job in physics, but 
in 2008, about 450 people with physics PhDs 
were seeking positions. Of those with physics 
PhDs in the United States from the classes 
of 2007 and 2008 who did not take a post- 
doc directly after earning their degree, 62% 
accepted private-sector positions, according 
to the AIP Statistical Research Center. Another 
10% accepted government positions, such > 
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> asat one of the US Department of Energy’s 
21 labs and technology centres. About one- 
quarter accepted academic posts. 

Few private-sector or industry positions 
require an applicant to have completed a post- 
doc — even if those posts involve research. 
Bailey says that she rarely advises physics 
graduate students to pursue a postdoc, unless 
they are certain they want a career in academia 
or they need a job while they consider their 
options. Physics postgraduates are, she says, far 
more likely to get full-time permanent employ- 
ment in the private sector, for which a postdoc 
is largely irrelevant. 

About 12 industrial employers in the United 
States are currently advertising jobs for people 
with physics PhDs on a members-only section 
of the APS website, but none of them lists a 
postdoc as a prerequisite. 

The applied-research nature of physics- 
related professions often makes a stint in 
academia less than useful. “A lot of physicists 
are just as comfortable doing engineering as 
engineers are,’ says James Duderstadt, presi- 
dent emeritus at the University of Michigan 
in Ann Arbor. “If they go to work for IBM, 
they’re going to be doing applied research. If 
they work at the Large Hadron Collider, they’re 
going to be doing applied physics,’ says Dud- 
erstadt, who was also a member of a National 
Academies panel that, in 2000, analysed the 
US postdoctoral experience. “Overall, it is not 
necessary, unless they have some experimental 
thing they need to learn in someone's lab that 
they can't learn ina PhD? 

A postdoc could even be a drawback for 
physicists considering the private sector. 
Many of Europe’s industrial employers pre- 
fer to hire applicants who have a PhD anda 
‘clean slate, says Maria Allegrini, a physicist at 
University of Pisa in Italy and a member of the 
APS Committee on International Scientific 
Affairs. “They want to train you in their own 
field? she says. 

Physicists with experience in US industry 
warn that a CV with three or more academic 
postdocs can create the impression that the 
applicant had hoped 


for a career in aca- he applied- 
demia and does not esearch 

really want one in mature of 
industry. “We wantto physics-related 
find someone whos professions often 
charged up with makes postdocs 


enthusiasm to work less thanuseful.” 
for our company,” 
says Jim Hollenhorst, Agilent’s senior direc- 
tor of technology, who regularly hires physics 
postgraduates. “So dont do three postdocs, not 
find an academic job, and then apply here.” 
Having multiple industrial postdocs 
could also be detrimental in some cases. If 
a researcher completes a two- or three-year 
postdoc at a company — requiring a sig- 
nificant training commitment from a senior 
researcher — and then leaves to do the same 


thing somewhere else, he or she may be viewed 
negatively by subsequent industrial hiring 
committees. But the outlook is also employer- 
and location-dependent, notes Dipali Bhatt- 
Chauhan, international-relations manager 
at the Institute of Physics in London. British 
industrial employers often view candidates 


Anne Goodsell says a postdoc can help young 
physicists expand their network and publish more. 


with several industrial postdocs as having 
greater skills and knowledge than those who 
have done one or none, she says. “Employers 
would be more likely to hire that postdoc.” 


SMART PURSUIT 

The requirements are different for those 
seeking a government or academic research 
post, including at US liberal-arts colleges. 
Catherine Crouch, a materials physicist at 
Swarthmore College in Pennsylvania, com- 
pleted two postdocs at Harvard University 
in Cambridge, Massachusetts, where she 
earned her PhD. In the first, she focused 
on physics-education research, with some 
teaching duties; in the second, she special- 
ized in experimental research in a liberal-arts 
environment — important at Swarthmore, a 
liberal-arts college that heavily weights fac- 
ulty research. As well as honing her research, 
she says, the fellowships taught her how to 
run a lab independently — and signalled to 
potential employers that she was capable of 
such. “When academic hiring committees 
are reviewing job applications, they look for a 
postdoc as evidence that you've had this kind 
of responsibility — how to run a research pro- 
gramme, how to pick good projects, how to 
decide what equipment will serve you well,” 
she says. “There are all these meta-questions 
that don't necessarily come into your arena as 
a graduate student.” 
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A postdoc expands young physicists’ net- 
works far beyond what they can typically 
achieve as a doctoral student, says Eric Jensen, 
chair of Swarthmore’s physics and astronomy 
department. “It gives you the chance to build 
more collaborations and develop a network of 
colleagues at other institutions that you can 
be in touch with and work with,’ he says. 

Postdocs can also be useful for those who 
are uncertain of their pathway or who want 
to pursue academia but are out of sync with 
the academic year hiring cycle. Anne Good- 
sell, an atomic and optical physicist at Mid- 
dlebury College in Vermont, says that people 
who earn their PhD around June, for example, 
and are aiming for a career in academia, can 
generally apply for posts starting only dur- 
ing the following semester. In the meantime, 
she says, such postgraduates, as well as those 
who are not sure of their direction, can take a 
postdoc to maintain existing contacts, estab- 
lish new ones, publish more, gain expertise in 
a subfield and sustain research momentum 
while waiting to apply or deciding what path 
to take. “It makes a lot of sense to take a post- 
doc to fill that gap,” Goodsell says. 

A postdoctoral fellowship at federal labs 
in Europe and the United States may some- 
times lead directly to a full-time permanent 
position. Mary Anne With, a postdoctoral 
adviser at Los Alamos National Laboratory 
in New Mexico, says that 80-90% of the lab’s 
technical positions are filled by former post- 
docs. Joseph Bernstein, an astrophysicist and 
postdoc at Argonne National Laboratory in 
Illinois, who is on his second renewal and 
fourth year, is considering either a consulting 
position or a third extension of his postdoc. 
The extension, he says, would be equivalent 
to a tenure-track assistant-professor post 
and is likely to lead to a permanent position 
at Argonne — and it pays well for a postdoc 
position. He declined an offer a couple of 
years ago for an assistant-professor position 
that would have meant taking a 10% pay cut. 
By extending his postdoc, Bernstein believes 
he will develop the technical computing and 
programming expertise necessary to become 
staff. “It’s a good opportunity,’ he says. 

So how does a student decide whether to 
pursue the postdoc path? If postgraduates are 
planning for a career in academia, it is still 
a requirement, acknowledges Bailey. But for 
those who remain uncertain, doing one post- 
doc is unlikely to damage anyone’s budding 
career, even if they decide to head for indus- 
try, says APS executive officer Kate Kirby. 
“Tt increases the number of connections you 
have in your field, she says. “It makes it pos- 
sible to get further letters of recommendation 
from a broader constituency — a tremendous 
advantage that can help you collaborate, get a 
grant or get a position.” m 


Karen Kaplan is assistant Careers editor at 
Nature. 
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TURNING POINT 


Jessica Hellman 


Conservation biologist Jessica Hellman studies 
the effects of climate change at the University 
of Notre Dame in Indiana. She was inspired by 
her one-year fellowship from the Aldo Leopold 
Leadership Program in Stanford, California 

to do something that few scientific laboratories 
do — create a mission and vision statement 
for her research group. 


What is the best piece of career advice you 
have received? 

My PhD adviser was Paul Ehrlich, a popu- 
lation biologist at Stanford University, and 
a public face of science. I studied with him 
because I wanted to have something to say 
about natural-resource management or how 
to deal with climate change. He was an ideal 
role model. But what he told me, and what I 
tell my students now, is “it’s all about science”. 
Concrete data, experience and knowledge are 
what give you a voice in the public sphere. 
If you bring nothing substantial to the table, 
you are just another voice. 


You have a fear of being labelled a ‘butterfly’ 
person. Why? 

I examine how climate change affects endan- 
gered species — an issue that has an inher- 
ent complexity because of the dynamics of 
temperature effects and species interactions. 
One of the species I’ve studied is the endan- 
gered Bay Checkerspot butterfly (Euphy- 
dryas editha bayensis), which runs out of 
food when high temperatures or drought 
cause its host plant to dry out. I don’t want 
people to miss the point: I study butterflies 
not because I am more fascinated with them 
than with other creatures but because, with 
so much data available, they serve as a useful 
proxy for insects in general. But one thing I 
learned from my leadership fellowship is to 
be comfortable in my own skin and recognize 
what distinguishes me as a researcher — ’'m 
learning to embrace my inner butterfly. 


Why are you trying to ‘brand’ your research? 
There are two dimensions to branding: com- 
municating to the world who you are and 
what you are about, and helping you to decide 
what you want to be doing. I wanted to be the 
person to study how populations are locally 
adapted to climate, which my group does. 


Do you think every scientist should brand 
themselves? 

I think it is a worthwhile exercise. When we 
first hire a researcher, we want a solid idea of 
who they are going to be, and it is important 


for them to think strategically about how 
they can set themselves apart — so it is help- 
ful to develop a plan that makes that clear. 


Why did you decide to write a mission and 
vision statement for your laboratory? 

I wanted to know who we were as a group and 
what we should focus on. Once you get tenure, 
there are a million things to do — collabora- 
tions, committees and grant applications to 
write. I decided that one way to be thoughtful 
about our work, and what to work on, would 
be to collectively write our mission and vision. 
I found some instructions and group exercises 
aimed at businesses, and then we hada retreat 
to strategically discuss how we could match 
our strengths, priorities and goals for making 
our research relevant to larger issues. We'll put 
a copy in the lab for new students. 


Will you continue to have retreats? 

Yes. My research group should have regular 
retreats. We have lab meetings to talk about 
science, but retreats focus on broader topics, 
such as professional development. I want to 
brainstorm with everybody about what they 
can be best at. Retreats are also a great way 
to build morale. Everybody wants to be part 
of something. 


Why did you start blogging and tweeting? 

I want to make science relevant to the rest of 
the world, be it landowners or government 
agencies — but I want to do it with transpar- 
ency and honesty. Social media opens the 
shop doors a bit more. I would never tell stu- 
dents that their time is better spent on a blog 
than ona published paper, but it is a great way 
to be more open. m 
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WORK-LIFE BALANCE 
Stress survey 


Scientists’ work obligations conflict with 
their personal lives three or more times 
aweek, finds a poll of 4,225 scientists by 
the Association for Women in Science 
(AWIS) in Alexandria, Virginia. A third 
of respondents to the global work-life 
satisfaction survey said that achieving 
work-life balance damages their careers. 
“There's a lot of stress out there; it’s global 
and not gender-specific, says AWIS 
past-president Joan Herbers, who blames 
the dissatisfaction in part on rigid tenure 
timelines and requirements. Extending 
the tenure clock to allow for birth or 
adoption, and asking tenure committees 
to rate candidates on the basis of their 
publications’ impact rather than total 
number, would ease stress, says Herbers. 


ACADEMIA 
Increasing salaries 


Faculty members at US doctorate-granting 
institutions received an average 1.7% salary 
increase for 2011-2012, 41.7% higher 

than the average 1.2% the previous year, 
according toa survey of 224,693 faculty 
members at 813 four-year colleges and 
universities. The survey conducted by 

the College and University Professional 
Association for Human Resources in 
Knoxville, Tennessee, found that faculty 
members at public doctoral institutions 
received a 1.2% average pay rise for 
2011-2012, three times higher than the 
year before. Private institutions awarded a 
2.4% rise for 2011-2012, up from last year’s 
2.2%. Assistant professors in biology and 
biomedicine earned US$59,907 in 2011- 
2012, and those in physical sciences earned 
$58,771, up 2% and 2.3%, respectively. 


RESEARCH 
NASA to award grants 


As part of an inaugural space-technology 
research programme, NASA is seeking 
grant proposals from early-career 
scientists. Ten researchers in areas such as 
communication and navigation systems; 
materials, structures and mechanical 
systems; and human health, life support 
and habitation systems will receive up 

to US$200,000 per year for up to three 
years. The programme aims to improve 
its future space-exploration and discovery 
capabilities by focusing on early-stage 
research and seeding innovation, says a 
NASA spokesperson. Notices of intent 

to submit are due on 30 March and the 
deadline for final proposals is 3 May. 
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VISITING BOB 


BY JOHN GILBEY 


ne of the great advantages of being 
() a writer is the fascinating folk you 

meet. Take Bob, for example. I first 
talked to him in Palo Alto about eight years 
ago. We chatted, got on, shared some bizarre 
and unlikely anecdotes — but like so many 
chance encounters, I didn’t think 
much more about it. Then, out of 
the blue, I was invited to write Bob’s 
biography. Hed read my published 
stuff, liked it, and asked me to doc- 
ument his career in the Valley. To 
say I was pleased would be a major 
understatement. 

A week later I was ensconced 
in a wood-panelled corner office 
at Bob’s host institute, overlook- 
ing the coffee shops of University 
Avenue, with an expenses-paid 
contract to extract from Bob all his 
tales of the old days that were fit to 
print — a constraint that probably 
reduced the size of the book by half. 
It was one of the most enjoyable and 
intriguing jobs I’ve ever had. 

Bob and I hadn't talked for sev- 
eral months when the call came, 
and I felt a sinking feeling when I 
realized that the caller wasn’t Bob 
but Emily, one of the cloud of cra- 
zily bright postdocs that he employs 
as assistants. Like all of us, Bob wasnt getting 
any younger. 

“Hi John, I thought I'd better call you 
about Bob...” Here it comes, I thought. “He's 
got some fairly serious health problems, and 
he’s been asking for you.” 

I thought about this. “Can't you put him on 
the line?” There was a regretful, apologetic 
pause. “I’m afraid not, he can’t interpret low- 
bandwidth AV now. Would you be able to visit 
him?” I was still pondering when she added: 
“Just one thing, dont leave it too long...” 


After the ten-hour flight, what I really 
wanted was beer, food and sleep. At a pinch, 
I'd skip the food. Emily, however, was insist- 
ent that I talk to Bob first. Looking into 
his room from an observation window, it 
seemed more chaotic than I remembered, 
but it was the primal waft of organic decay 
— nauseating even from the next room — 
that shocked me. I glanced across at Emily, 
who gave a slow nod of the head. 

“Hi, Bob, how are you doing?” The 
response was slow, like someone waking 
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Life insurance. 


from an anaesthetic. “John? Is that you?” The 
voice was stretched and blurred, but the sen- 
sory unit rotated and waddled towards the 
window like an inebriated desk lamp. 

“T really appreciate you coming over,” 
muttered Bob. “As you can see, things aren't 
going too well... I don’t know how much 
Emily has told you, but it looks like I only 


have a month or so left. I’ve already lost 
some peripheral stuff — that’s why I can’t use 
remote comms — but it is creeping into my 
long-term memory and... heck, I’m scared. 

“[need you to buy me afew more years... 
Just a few...” His voice faded. “Sorry, ?m 
going to have to rest for a while now — just 
follow the money...” When I looked round, 
I found that Emily had tears in her eyes — 
which made two of us. 

“What does he need?” I asked Emily. 
She sighed and referred to an internal list 
—a long one. Many of Bob’s systems were 
well beyond their technical ‘end of life’: the 
organic wet-ware circuits that formed most 
of his unique design were literally rotting, and 
had only ever been made as research tools, 
not production systems. No one seemed to 
know whether the technology could be rep- 
licated with the skills now available — or even 

if it would be legal — but 


DNATURECOM there seemed no way 
FollowFutureson = that Bob’s conscious- 
Facebook at: ness could be ported to 
go.nature.com/mtoodm © another infrastructure. 
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We found a bar, drank several beers, and 
tried to sketch out a budget for a rebuild. The 
numbers were astronomical, and it was clear 
from Emily’s terse analysis that all attempts 
to gather funding through the usual channels 
had failed. 

Sleep wasn't an option. As headlights 
panned across the ceiling of my motel room 
I fought to find a solution. “Follow 
the money,’ Bob had said. Bob — the 
‘Bucket of Brains’ they used to call 
him, much to his annoyance. There 
were no physical brains in the sys- 
tem. Instead, the reasoning power of 
20 top scientists of the day had been 
distilled into a semi-organic struc- 
ture that — to everyone's surprise 
— had fought schizophrenia and 
paranoia to emerge as the compos- 
ite personality of Bob. The first, and 
last, of his kind. “Follow the money.’ 
Easy to say, but why me? 


The coffee was so strong that my 
vision took on the character of a 
shaky film noir — but it fought the 
jet lag. Emily toyed with her green 
tea and tried not to look concerned. 
Iasked the critical question: “When 
I was here last, I left a set of note- 
book files — stuff that Bob wouldn't 
let me use in the book, or even have 
copies of. Are they still here?” 

Emily shrugged slightly. “We never throw 
anything away, that’s part of our culture” 

I smiled with relief, thinking of all the 
doors — and bank accounts — that were 
about to open. 

Emily looked puzzled. “Pll have to check? 
I qualified carefully, “but from memory, 
those notes cover every major scandal 
— personal, corporate and political — in 
the Valley between 2015 and 2035. Bob's 
component consciousnesses were deeply 
involved — more to the point, the other folk 
implicated are now in very senior posts and 
definitely won't want this stuff made pub- 
lic ...“Follow the money...” 

The mug of tea dropped to the table top. 
“You mean, he wants to blackmail people to 
raise the money?” Emily exclaimed in shock. 

“Tt certainly looks like it,” I replied with 
a smile. “I’m just beginning to realize how 
human Bob is.” m 


John Gilbey (gilbey@bcs.org.uk) is a writer 
who knows a thing or two about mysterious 
inner rottings — but not blackmail, honest... 
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Structure of the mitotic checkpoint 


complex 


William C. H. Chao, Kiran Kulkarni!*, Ziguo Zhang', Eric H. Kong! & David Barford! 


In mitosis, the spindle assembly checkpoint (SAC) ensures genome stability by delaying chromosome segregation until 
all sister chromatids have achieved bipolar attachment to the mitotic spindle. The SAC is imposed by the mitotic 
checkpoint complex (MCC), whose assembly is catalysed by unattached chromosomes and which binds and inhibits 
the anaphase-promoting complex/cyclosome (APC/C), the E3 ubiquitin ligase that initiates chromosome segregation. 
Here, using the crystal structure of Schizosaccharomyces pombe MCC (a complex of mitotic spindle assembly checkpoint 
proteins Mad2, Mad3 and APC/C co-activator protein Cdc20), we reveal the molecular basis of MCC-mediated APC/C 
inhibition and the regulation of MCC assembly. The MCC inhibits the APC/C by obstructing degron recognition sites on 
Cdc20 (the substrate recruitment subunit of the APC/C) and displacing Cdc20 to disrupt formation of a bipartite D-box 
receptor with the APC/C subunit Apcl0. Mad2, in the closed conformation (C-Mad2), stabilizes the complex by optimally 
positioning the Mad3 KEN-box degron to bind Cdc20. Mad3 and p31°°™* (also known as MAD2L1-binding protein) 
compete for the same C-Mad? interface, which explains how p31°°™ disrupts MCC assembly to antagonize the SAC. This 


study shows how APC/C inhibition is coupled to degron recognition by co-activators. 


The fidelity of chromosome separation in mitosis is governed by an 
evolutionarily conserved cell-cycle checkpoint mechanism called the 
SAC’. The SAC arrests the mitotically dividing cell to allow complete 
chromosome attachment to the bipolar mitotic spindle. The essence of 
the SAC is to block the onset of anaphase by inhibiting APC/C- 
mediated ubiquitin-dependent degradation of securin and mitotic 
cyclin. Components of the SAC that are responsible for detecting 
unattached kinetochores and for propagating signals to the APC/C 
have been identified**, but the molecular basis underlying these pro- 
cesses is only partially understood’*. Mad2 and Mad3 (BubR1 in 
metazoans) mediate APC/C inhibition through their association with 
its co-activator subunit Cdc20 (refs 5-12). Mad2, Mad3 and Cdc20 
(together with mitotic checkpoint protein Bub3) form the MCC that 
directly binds the APC/C to inhibit substrate recognition’*'*. Mad2 
and Mad3 cooperate to antagonize Cdc20-dependent activation of the 
APC/C”, with Mad3-Cdc20 interactions requiring the pre-assembly 
of a Cdc20-Mad2 complex'*'*®. Thus, SAC signalling occurs through 
the generation of the Cdc20-Mad2 complex, a process initiated by 
Mad1, which is the Mad2 receptor at unattached kinetochores. 
Central to the association of Mad2 with Cdc20 is the inter-conversion 
of Mad2 between the open (O-Mad2) and closed (C-Mad2) structural 
states”, These states of Mad2 differ in the topology of a carboxy- 
terminal B-sheet that repositions in C-Mad2 to enable binding to its 
protein ligands, Mad1 or Cdc20 (refs 20-22). In the template model for 
SAC activation, Mad1 interacts with C-Mad2, generating the C-Mad2- 
Mad1 complex that subsequently recruits O-Mad2 through the 
C-Mad2 dimerization interface. By inducing the conformational trans- 
ition of O-Mad2 to C-Mad2, the Mad1-bound C-Mad2 subunit cata- 
lyses the binding of Mad2 to Cdc20 (refs 17, 23). 

APC/C activity and its substrate recruitment are dependent on its co- 
activators (Cdc20 and Cdh1)”, which recognize APC/C substrates 
through two destruction motifs (degrons); the D box” and the KEN 
(Lys-Glu-Asn) box’®. Mad3 contains a KEN box that is essential for 
MCCassembly’*”’”*, suggesting that Mad3 may act as a pseudosubstrate 


to block substrate recognition by APC/C“°. However, other studies 


showing that the promotion of ubiquitin-mediated degradation of 
Cdc20 by the SAC is dependent on the KEN box of Mad3 (refs 18, 
27, 29) indicate that there is a more complex mechanism controlling 
APC/C“° activity. The mechanisms underlying APC/C activation 
after SAC silencing are also poorly understood. In metazoans, p31°™* 
antagonizes the SAC” by functioning as a structural mimic of Mad2 
that binds at the Mad2 dimerization interface to inhibit the conforma- 
tional activation of O-Mad2 (ref. 31). UbcH10, assisted by p31°°™, 
catalyses Cdc20 ubiquitination, which leads to the disassembly of 
the MCC”. 

To understand the molecular mechanisms underlying the mitotic 
checkpoint complex, we determined the crystal structure of the fission 
yeast MCC. The structure shows how Mad2 and Mad3 cooperatively 
inhibit Cdc20, and indicates how p31°™ would antagonize MCC 
assembly. The structure of Cdc20 in the context of the MCC offers the 
first opportunity to visualize degron recognition by co-activators. The 
interaction between Mad2 and Mad3 positions the Mad3 KEN box 
towards the KEN-box receptor of the Cdc20 WD40 domain. 
Additionally, an unexpected D-box mimic of the Mad3 C terminus 
reveals the D-box-binding site on Cdc20, thus demonstrating the 
structural basis of D-box recognition by co-activators. 


Overall structure of the MCC 

We generated the fission yeast MCC by co-expressing Cdc20, Mad2 
and Mad3 in insect cells. The complex comprises Cdc20 with all 
functional domains (C box, Mad2-binding motif, WD40 domain 
and Ile-Arg tail (Supplementary Fig. 1)), Mad2 locked in its closed 
conformation that facilitates binding to Cdc20 (ref. 31), and Mad3 
truncated after the tetratricopeptide repeat (TPR) domain*’ and thus 
lacking its C-terminal KEN box. Bub3 was omitted from the complex 
because previous studies indicated that Bub3 was not an integral part 
of MCC in fission yeast** and was not required for MCC-mediated 
inhibition of human APC/C™. 


1Division of Structural Biology, Institute of Cancer Research, Chester Beatty Laboratories, 237 Fulham Road, London, SW3 6JB, UK. 


*These authors contributed equally to this work. 
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The crystal structure of the MCC, at 2.3 A resolution, shows that 
Cdc20, Mad2 and Mad3 assemble into a triangular heterotrimer 
(Fig. 1 and Supplementary Table 1). Mad3 coordinates the overall 
organization of the complex by forming numerous inter-subunit 
interactions with Mad2 and Cdc20, whereas Cdc20 and Mad2 interact 
primarily through the sequestering of the Mad2-binding motif of 
Cdc20 by the Mad2 ‘safety belt’ (refs 21, 22). The core architecture 
of Mad3 is a contiguous TPR superhelix with three TPR motifs 
flanked by capping «-helices that resembles the BubR1 amino ter- 
minus*’. In addition, not present in the BubR1 structure, the con- 
served N terminus of Mad3 that incorporates the N-terminal KEN 
box that is essential for MCC assembly’*?””* adopts a helix—loop- 
helix (HLH) motif (Fig. 1 and Supplementary Fig. 2). The HLH motif 
simultaneously binds Mad2 and Cdc20, orienting the KEN box 
towards its receptor on Cdc20 (Fig. la, b). Mad3 also contacts 
Mad2 and Cdc20 through its TPR domain (Fig. 1a, c). 

The WD40 domain of Cdc20 conforms to a canonical seven-bladed 
B-propeller (Fig. 1a). The Mad2-binding motif, N-terminal to the 
WD40 domain, is structurally well defined, engaging the safety 
belt”'”* of Mad2 and adopting a conformation that is similar to the 
Mad2-binding motif of Mad1 bound to Mad2 (ref. 22) (Fig. 2a). The 
linker connecting the Mad2-binding motif with the N terminus of the 
WD40 domain is disordered, as are the two APC/C interacting motifs: 
aconserved C box that is immediately N-terminal to the Mad2-binding 
motif, and the C-terminal Ile-Arg tail that interacts with the TPR 
subunit Cdc27°° (Supplementary Fig. 1). 

Mad? is an «/B-HORMA-class protein” (Fig. 1a and Supplemen- 
tary Fig. 3). Its C-Mad2 conformation enables interactions with the 


Mad3 


) Mad3 HLH 
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Mad2-binding motifs of Cdc20 and Mad1 and, as revealed in the 
MCC crystal structure, is the only state of Mad2 that can recognize 
Mad3. Mad2 interacts through its a-C helix and B8’-B8"’ hairpin 
with the HLH motif of Mad3 (Fig. 2b). Notably, such Mad2 inter- 
actions resemble those between C-Mad2 and p31°™* (ref. 31.) 
(Fig. 2c), and C-Mad2 in an asymmetric C-Mad2-O-Mad2 dimer’ 
(Fig. 2d). The involvement of the B8’-B8’’ hairpin—a region of Mad2 
that undergoes substantial conformational change upon transition 
from the open to closed states*'—at the Mad2-—Mad3 interface indi- 
cates that Mad3 binds exclusively to C-Mad2. 

Mutations of the Mad2 «-C helix disrupt C-Mad2-Mad3 inter- 
actions*’, consistent with our structure. To test the Mad3 interface, 
we replaced Met 13 with Arg (Fig. 2b). This mutation dissociated 
Mad3 from a Cdc20-C-Mad2 heterodimer when size-exclusion 
chromatography was performed (Supplementary Fig. 4). Together, 
these data confirm the physiological relevance of the C-Mad2-Mad3 
interface and show that C-Mad2 is required to confer high-affinity 
binding of Mad3 to Cdc20, consistent with studies showing that 
Mad3 association with Cdc20 in vivo is synergistic with Mad2 (refs 
12, 15, 16). 


The Cdc20 WD40 domain is a receptor for KEN and D box 


We identified two highly conserved surfaces on the Cdc20 WD40 
domain, and these are responsible for APC/C degron recognition: 
the KEN-box” receptor, situated on the top side of the WD40 domain 
at the centre of the B-propeller (Fig. 3a), and the D-box”* co-receptor 
lying in a channel between blades 1 and 7 (Fig. 4a). The KEN-box 
residues of Mad3 (Lys 20, Glu 21 and Asn 22) emerge from the C 


Figure 1 | Structure of S. pombe MCC trimer. 
a, Cartoon representation of the complex of Mad2 
(green), Mad3 (cyan) and Cdc20 (yellow). Mad2 is 
in the C-Mad2 conformation. The KEN box is 
shown in red, located in the HLH motif of Mad3. 
The D-box mimic (magenta) that is bound to 
Cdc20 is from the C terminus of Mad3 from a 
symmetry-related molecule. The Mad2 B8’-B8"’ 
hairpin (dark green) that forms the Mad2—-Mad3 
interface, repositions on conversion from O-Mad2 
to C-Mad2. The N terminus of the WD40 domain 
is indicated. b, Details of the Mad3 HLH 
interaction with Mad2 and Cdc20. c, Surface 
representation of the MCC. The interaction 
between Mad2 and Mad3 positions the Mad3 KEN 
box at the KEN-box receptor, which is located at 
the centre of the top side of Cdc20’s WD40 domain. 
MB, Mad2-binding. 
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terminus of the o-A helix of the HLH as an underwound turn of 
a-helix (Figs 1 and 3b). Their conformation resembles a ‘U’, with 
Glu 21 at the turn dipping into a depression at the centre of the 
Cdc20 B-propeller toroid. Tyr 181 of Cdc20, situated on an extended 
loop connecting blade 1 with blade 7, acts as a platform to support the 
aliphatic moiety of the Glu 21 side chain. Apart from this single 
hydrophobic interaction, the KEN box forms entirely polar contacts 
involving its side-chain and main-chain groups with conserved polar 
and charged residues of loops on four blades of the Cdc20 B-propeller. 
Notably, the five residues of Cdc20 that contribute side-chain 
interactions to the KEN box (Asp 180, Asn 326, Thr 368, Gln 392 
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Figure 2 | Details of Cdc20-Mad2-Mad3 
interactions. a, Mad2-binding motif of Cdc20 
bound to the Mad2 safety belt. b, A universal Mad2 
dimer interface contacts Mad3, p31°™ and 
O-Mad2. Mad2-Mad3 interactions. c, Mad2- 
p31°°"* (ref. 31) (Protein Data Bank (PDB) 
accession 2QYF). d, C-Mad2-O-Mad? (ref. 37) 
(PDB accession 2V64). 


Mad3 HLH 


and Arg 438) are invariant in both Cdc20 and Cdhl, indicating a 
universal mode of co-activator-KEN box interaction (Supplemen- 
tary Fig. 5). A proline (Pro 25) positioned three residues C-terminal 
to the Mad3 KEN box—well conserved in KEN-box motifs—acts to 
break the o-helix and orient the polypeptide chain away from the 
Cdc20 surface. 

The Mad3 KEN box has been proposed to function as a pseudo- 
substrate inhibitor, blocking access of Cdc20 to KEN-box degrons in 
APC/C substrates’*. Thus, the mode of binding of the Mad3 KEN box 
to Cdc20 should serve as a model for understanding how co-activators 
recognize APC/C substrates. Consistent with this idea, mutating the 


Figure 3 | The KEN box binds to a conserved 
surface at the centre of the top side of the WD40 
domain. a, Cdc20 shown with surface 
representation coloured according to the sequence 
conservation of Cdc20 and Cdh1 (yellow 
(invariant) to green (less conserved)) 
(Supplementary Fig. 5). Mad3 diagram with KEN 
box in red. b, Details of KEN-box interactions with 
the Cdc20 WD40 domain. c, Mutation of the KEN- 
box-binding site abolishes S. cerevisiae APC/C“*"'- 
mediated ubiquitination of securin as indicated by 
an in vitro transcription or translation (IVT)-based 
ubiquitination assay. KEN-box mutants: N405A, 
N407A, Q473A, R517L (S. pombe Cdc20: N324, 
N326, Q392, R438). 
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Figure 4 | The D box binds in an extended 
conformation to a conserved inter-blade channel 
on the Cdc20 WD40 domain. a, Cdc20 shown 
with surface representation coloured according to 
sequence conservation of Cdc20 and Cdh1 (yellow 
(invariant) to green (less conserved)). 
(Supplementary Fig. 5). D-box mimic (C terminus 
of Mad3) is shown in purple. b, D-box mimic 
bound through a peptide-in-channel mechanism 
to Cdc20. Lys 92 of Mad3 contacts the negatively 
charged D-box Arg-binding site of Cdc20, 
sterically hindering the D-box receptor of Cdc20. 
c, Negative electrostatic potential of Cdc20 in the 
vicinity of the Arg-binding site of modelled D box. 
d, Model of a KEN-D peptide bound to KEN and 
D-box sites of Cdc20. e, Mutation of the D-box- 
binding site abolishes S. cerevisiae APC/C“"!- 
mediated ubiquitination of securin as indicated by 
an IVT-based ubiquitination assay. D-box Arg- 
binding-site mutants: D256S, D536S, E5378 (S. 
pombe Cdc20: D173, D457, E458). Mutated S. 
— cerevisiae Cdh1 residues are indicated in black with 
r equivalent residues of S. pombe in red. At the D-box 
Leu-binding site, mutation of S. pombe Cdc20 Val 
196 had the most notable effect, with little loss of 
activity for Leu460Met. Mutation of Arg 170, 
located distal to the Arg and Leu sites (Fig. 4c), did 
not affect APC/CC*"! activity. f, Ubiquitination 
competition assay. The KEN-D box peptide 
inhibits S. cerevisiae APC/C*"!-mediated securin 
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equivalents of the Cdc20 residues Asn 326, Thr 368, Gln 392 and 
Arg 438 in Saccharomyces cerevisiae Cdh1 abolished APC/C“"!-cat- 
alysed ubiquitination of the KEN-box and D-box-dependent APC/C 
substrates securin and Clb2 (Fig. 3c and Supplementary Fig. 6). Co- 
migration of APC/C and Cdh1 on a native gel confirmed that the 
inactivation was not due to misfolding of the mutant co-activator 
(Supplementary Fig. 7). Our findings that the KEN box of Mad3 is 
embedded within a segment of ordered structure, possibly stabilized 
on the Mad3 HLH motif through contacts to Mad2 (Fig. 1), suggest 
that degrons do not need to be disordered to allow substrate recog- 
nition by ubiquitin ligases. 

The WD40 domain of co-activators interacts with the D-box 
degron (RXXLXX(I/L)(S/T)N)?*””, acting as a co-receptor with the 
APC/C subunit Apc10 (refs 40, 41). However, the molecular details of 
co-activator-D box interactions have not been established. In the 
crystal structure of the MCC, an unexpected crystal packing contact, 
in which the C terminus of a neighbouring Mad3 subunit partially 
mimics a D box and contacts Cdc20, provides detailed molecular 
insights of co-activator-D box recognition. The D box mimic of 
Mad3 was found in an extended conformation engaged within a 
conserved channel located between blades 1 and 7 on the rim of the 
B-propeller toroid (Fig. 4b and Supplementary Fig. 8). The interaction 
is dominated by the burial of the aliphatic side chain of Leu 215 of 
Mad3, mimicking the Leu of the D-box RXXL motif, within a deep 
pocket perfectly matching a Leu side chain, and created by non-polar 
residues invariant in Cdc20 and Cdh1 (Fig. 4a and Supplementary 
Fig. 5). Other consensus residues of the D box—the essential and 
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invariant Arg and less-well-conserved (I/L)(S/T)N motif*’—are not 
represented in the Mad3 D-box mimic. However, a candidate recog- 
nition site for the Arg side chain of the D box can be assigned to a 
cluster of negatively charged residues (Asp 173, Asp 457 and Glu 458) 
located on two adjacent B-strands of blade 7 on the top side of the 
D-box-binding channel. These residues are suitably positioned to 
interact with an Arg residue that is N-terminal to the anchored Leu 
residue, and are conserved in Cdh1 (Fig. 4b, cand Supplementary Fig. 5). 

To test whether the putative D-box-binding site that was identified 
on Cdc20 is responsible for recognizing the D box of APC/C substrates, 
we generated mutants to abolish individually the equivalent Arg- and 
Leu-binding sites in Cdh1. Disrupting the Leu-binding pocket by repla- 
cing Val 196 with a bulky methionine, and substituting Ser for the Cdh1 
equivalents of Asp 173, Asp 457 and Glu 458 at the putative Arg- 
binding site, eliminated the ability of Cdh1 to stimulate APC/C activity 
(Fig. 4e and Supplementary Fig. 6) but had no affect on co-activator 
binding to the APC/C (Supplementary Fig. 7). These results are con- 
sistent with the inter-blade channel of Cdc20 functioning as the D-box- 
binding site. The region of this channel that probably contacts the 
D-box (I/L)(S/T)N motif is less well conserved between Cdc20 and 
Cdh1 (Fig. 4a), possibly explaining the differential affinity of the two 
co-activators for D-box degrons. 


Substrate and inhibitor recognition by co-activators 
To characterize further the APC/C degron-binding sites on co- 


activators, we designed peptides that incorporate both KEN-box 
and D-box motifs and tested their ability to inhibit APC/C°*"'. 


©2012 Macmillan Publishers Limited. All rights reserved 


Examination of the Cdc20 structure suggested that a peptide with 
KEN and D boxes linked by 17 residues (KEN-D peptide) would 
enable cooperative binding of both degrons to their respective binding 
sites, thereby conferring higher-affinity binding than peptides with 
either degron alone (Fig. 4d). In contrast, a peptide with the reverse 
orientation of KEN and D boxes (D-KEN peptide), also connected by 
a 17-residue linker, would only permit a single degron to bind. When 
we compared the potential of these peptides to competitively inhibit 
securin ubiquitination by APC/C“"!, we found that the KEN-D 
peptide was five times more potent as an inhibitor than the D-KEN 
peptide (Fig. 4f). Because D-KEN peptide inhibited APC/C“" with 
the same efficiency as a D-box peptide (Supplementary Fig. 9a), these 
results indicate that cooperative degron binding to Cdh1 was con- 
ferred by the specific spatial arrangement of the KEN and D boxes of 
the KEN-D peptide, in agreement with our assignment of the KEN- 
and D-box-binding sites on co-activators. 

The spatial arrangement of KEN and D boxes in the KEN-D 
inhibitory peptide is markedly similar to the organization of KEN 
and D boxes in two APC/C inhibitors: Acm1 and Mes] (refs 43-45) 
(Supplementary Fig. 10). Both proteins inhibit the APC/C co-activators 
through a pseudosubstrate-based mechanism that is dependent on 
their D and KEN boxes, and it seems reasonable to assume that the 
spacing of 18 and 24 residues between the KEN and D box degrons of 
Acm1 and Mes1, respectively, optimizes inhibitor-co-activator affinity. 
Here we show one mechanism by which a KEN- and D-box-containing 
protein would bind co-activators. However, because of the diverse 
configuration and relative separation of KEN- and D-box motifs in 
APC/C substrates, several modes of co-activator-substrate recognition 
probably exist. Our assignment of the D-box recognition site to an 
inter-blade channel of the co-activator B-propeller was recently con- 
firmed by a Cdhl-Acm1 crystal structure (W.C.H.C., D.B. and J. He, 
unpublished observations). 

KEN- and D-box motifs adopt different conformations when 
bound to Cdc20; the KEN box is an underwound helix (Fig. 3b), 
whereas the D box assumes an extended structure (Fig. 4b). This 
important distinction means that the three residues of the KEN motif 
are presented to the same surface of Cdc20, whereas in the D box 
alternative amino acid side chains are oriented in opposite directions. 
Thus, with the Leu side chain of the D box anchored by the co- 
activator, conserved Arg, Ile/Leu and Asn side chains at positions 1, 
7 and 9 of the D box”, respectively, would be accessible to generate a 
composite D-box-co-activator recognition surface for the D-box co- 
receptor Apcl0 (refs 40, 41). This is consistent with the identification 
of the D-box co-receptor at the interface of co-activator and Apc10 in 
an APC/C“""P °° ternary complex". 


Implications for MCC-mediated inhibition of the APC/C 


The processes underlying MCC-mediated inhibition of APC/C are 
incompletely defined, and probably involve several mechanisms. 
Mad3, which is dependent on its N-terminal KEN box, blocks 
Cdc20-mediated substrate recognition”’, consistent with our structure 
showing that the Mad3 KEN box binds to the KEN-box recognition site 
of Cdc20. Notably, the same KEN box also promotes Mad3-dependent 
APC/C-mediated degradation of Cdc20 (refs 18, 27, 29), which sug- 
gests that Mad3 has a role in positioning Cdc20 for ubiquitination by 
the APC/C’s catalytic centre. To understand this function of Mad3, 
we docked our MCC coordinates into the electron-microscope- 
derived molecular envelope of the APC/C-MCC complex" (Fig. 5). 
Interpretation of this structure was based on our previous sub- 
unit assignment and pseudo-atomic model of budding yeast APC/ 
COMED Dox (ref. 46). The MCC crystal structure corresponds closely 
to the assigned MCC density of APC/CM°, with Cdc20, Mad2 and 
Mad3 clearly recognizable. Mad2 contacts the TPR subunits Cdc23 
and Apc5, whereas Mad3 interacts with Apcl. There is insufficient 
unassigned density to account for the C-terminal kinase domain of 
BubRI1 and Bub3, suggesting their structural disorder. Notably, the 
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Figure 5 | Pseudo-atomic structure of human APC/C™. a, Docking of 
MCC coordinates into a human APC/C™ electron microscope map". APC/ 
C subunit coordinates are based on the pseudo-atomic structure of S. cerevisiae 
APC/C*. In the context of the APC/C™“S, Cdc20 is in a downwards position 
relative to APC/C“"" (ref. 41). b, Details of MCC interactions with APC/C 
showing the position of the D-box co-receptor on Cdc20 relative to Apc10 and 
the catalytic centre of Apc2 and Apcll. The downwards position of Cdc20 
prevents interaction with Apc10. Surface representation of Cdc20 is colour- 
coded from yellow (invariant) to green (less conserved). 


KEN-box-binding site of Cdc20 is blocked by Mad3, whereas the 
D-box site is directed towards, but not in contact with, its co-receptor 
Apcl0 (Fig. 5b). Compared with APC/C“*"!P >, in the APC/CM 
complex, Cdc20 is displaced downwards towards Apc5. This position 
may facilitate Cdc20 ubiquitination. Furthermore, the lower position of 
co-activator prevents its D-box-binding site from generating a bipartite 
D-box co-receptor with Apcl0 (ref. 41). This, together with a partial 
steric blockade of the arginine site of the D-box co-receptor of Cdc20 
by Lys 92 of the Mad3 TPR domain (Fig. 4b), explains the inability of 
APC/CM® to recognize and ubiquitinate D-box-dependent sub- 
strates". In APC/CMS, a region of the Cdc20 D-box-binding site for 
the Leu anchor and C-terminal residues, is accessible (Fig. 5b), suggest- 
ing that a non-consensus D-box sequence (that is, lacking an Arg residue 
and not dependent on the co-receptor Apc10) could engage this site. 

In addition to the interactions of Mad2 and Mad3 with APC/C 
subunits, the position and activity of Cdc20 in the context of APC/ 
CMC© might also be influenced by the sequestration of its N-terminal 
APCIC recognition motifs by Mad2. The Mad2-binding motif of 
Cdc20 also mediates APC/C interactions*’, which would be blocked 
by the Mad2 safety belt (Fig. 5b). Furthermore, constraining the 
Cdc20 N terminus by Mad2 might also prevent the neighbouring C 
box from accessing its APC/C-binding site, which is necessary for 
Cdc20-dependent stimulation of APC/C catalytic activity®’. 

The SAC is antagonized by p31°°™ (refs 30,49), and the structure 
of the C-Mad2-p31°°™* complex showed that p31°°™“ binds at the 
Mad2 dimerization interface to inhibit the conformational activation 
of O-Mad2 (ref. 31). However, it is clear from our MCC structure that 
because Mad3 and p31°°™* bind to a common Mad2 interface, 
p31°°™* would compete for Mad3 interactions with Mad2 (Fig. 2c), 
explaining how p31" both antagonizes the assembly of the MCC”! 
and promotes its disassembly**”°. 

Here we show how the molecular basis for APC/C inhibition by its 
regulators is coupled to degron recognition by co-activators. APC/C 
activity is modulated by sterically blocking substrate recognition and 
through conformational changes that disrupt the substrate-binding 
site, reminiscent of modes of protein kinase regulation. This contrasts 
with the SCF complex whose activity is regulated at the level of sub- 
strate recognition through degron phosphorylation. 


METHODS SUMMARY 


Expression, purification and crystallization of S. pombe MCC. The S. pombe 
MCC (involving Cdc20, Mad2 and Mad3) was generated by co-expressing 
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Cdc20-Slp1 (residues 87-488), Mad2 (with Leu12Ala and Arg133Ala mutations) 
and Mad3 (residues 1-223) in the baculovirus and insect cell systems. 
Crystallization, crystal structure determination, mutagenesis, enzyme assays 
and other procedures were performed as described in the Methods. Diffraction 
data were collected at beamline 102 at the Diamond Light Source. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cloning, expression and _ purification of the MCC _ complex. 
Schizosaccharomyces pombe CDC20-SLP1 (residues 87-488), MAD2 (residues 
1-203) and MAD3 (residues 1-223) genes were amplified by polymerase chain 
reaction (PCR) using an S. pombe complementary DNA library pTN-TH7 (a gift 
from T. Nakamura) as a template and cloned into a modified pFBDM vector*! 
(Z.Z. & D.B., unpublished observations). A double Strep-tag II (ds) together with 
a tobacco etch virus (TEV) cleavage site were introduced into the N terminus of 
Cdc20-Slp1 and mutations of Leul2Ala and Arg133Ala were introduced into 
Mad? (ref. 31). A previous study showed that the Arg133Ala mutant was fully 
capable of inducing a mitotic arrest*’. The resultant protein expression cassettes 
were recombined with the DH10MultiBac cells to create a bacmid by transposi- 
tion®'. The ds-Cdc20-Mad2-Mad3 complex was expressed using the baculovirus 
and insect cell (High 5 cells) systems, and purified by a combination of Strep- 
Tactin (Qiagen), anion exchange chromatography Resource Q and Superdex 200 
size-exclusion chromatography (GE Healthcare). The identities of the purified 
complex subunits were confirmed by antibody labelling and N-terminal sequen- 
cing. Mutagenesis was performed to generate the Mad2-Mad3 interface mutant, 
Cdc20-Mad2(A137Y)-Mad3(M13R). 

Crystallization, data collection and structure determination. Crystals were 
obtained in a hanging-drop fashion by pre-incubating 1:1 v/v of 4.5 mg ml! 
of protein with the crystallization solution, 100 mM Tris-HCl pH 8.8, 21% PEG 
3350, 30% ethylene glycol, 5 mM dithiothreitol (DTT), and 5 mM EDTA at 20 °C 
for 24 h, followed by streak seeding. Crystals grew to full size after 2 weeks and 
were mounted in 0.2-0.3 mm cryoloops and frozen in liquid nitrogen. Native 
crystals diffracted to a minimum Bragg spacing (din) of about 2.1 A. The dif- 
fraction data set was collected at the 102 beamline of Diamond Light Source from 
a single crystal, processed with XDS and scaled to 2.3 A with SCALA®. Phase 
information was obtained by molecular replacement with AMoRe™. Monomeric 
coordinates from the crystal structures of human MAD2(L13A) (PDB 2VFX)”*, 
human BUBRI1 (PDB 2WVI)* and human WDRS5 (PDB 3EMH)”* were used as 
search models. All three trimeric complexes (MCC complexes) that are present in 
the asymmetric unit were assembled with repeated runs of AMoRe. Iterative 
model building and refinements were carried out with COOT” and PHENIX™, 
respectively. Simulated annealing composite omit maps were systematically cal- 
culated at several stages of model building and refinement, and examined to 
minimize the effects of model bias. TLS parameters that were generated from 
the TLS motion determination server (http://skuld.bmsc.washington.edu/ 
~tlsmd/) were used throughout the refinement. Water molecules were added 
towards the end of the refinement. The structure was validated with MolProbity”. 
Data collection and refinement statistics are given in Supplementary Table 1. 
APC/C ubiquitination assays with wild-type and mutant Cdh1. Saccharomyces 
cerevisiae Cdh1 mutants were generated using PCR-based mutagenesis and 
cloned into a linearized pRSET vector. A functional T7 promoter and the Cdh1 
sequences were confirmed by DNA sequencing. APC/C ubiquitination assays 
were adopted and modified from ref. 61. *°S-labelled Clb2p and securin 
(Pdslp) and unlabelled Cdh1 mutants were prepared using TNT T7 Quick- 
coupled in vitro transcription (or translation) (IVT; Promega). Each ubiquitina- 
tion reaction contains approximately 10 ng of S. cerevisiae APC/C, 1 ul of 
*°S-labelled substrate and 2 ul of Cdh1 in a 10-1 reaction volume with 40 mM 
Tris-HCl pH 7.5, 10 mM MgCl, 0.6 mM DTT, 2.7 mM ATP, 6.6 jig of methyl 
ubiquitin, 500 ng of Ubc4, 200 ng of ubiquitin aldehyde (Enzo Life Science), 2 1M 
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LLnL (N-acetyl-Leu-Leu-Norleu-aldehyde) (Sigma). Reactions were incubated at 
room temperature for 15 min and were analysed using 8% SDS-PAGE. Gels were 
fixed and stained with Coomassie blue, then dried and exposed to BioMax MR 
Film (Kodak). 

Native gel electrophoresis. Correct folding of S. cerevisiae Cdh1 mutants was 
assessed by their co-migration with the APC/C in native gel electrophoresis. 50 ng 
of apoAPC/C was mixed with 2 1l of *°S-labelled IVT-produced Cdh1 and 0.7 tl 
of 100 mM CaCl, in a volume of 14 ul with 10 mM Tris pH 8.0, 150 mM NaCl, 3 
mM DTT, 1 mM magnesium acetate and 2 mM EGTA. Samples were incubated 
at room temperature for 15 min before adding 1 pl of native gel loading buffer 
(125 mM Tris pH 8.0, 84% (v/v) glycerol) to each reaction. The entire reaction 
was loaded onto a 5.25% non-denaturing polyacrylamide gel run at 4 °C, 110 V 
for 2 h. Gels were fixed and stained with Coomassie blue, then dried and exposed 
to film. 

KEN-D peptide competition assays. The KEN-D (Ac-NKENEGPASGASGA 
SGASGAQRAALSDITNS-NH2), D-KEN (Ac-QRAALSDITNSGASGASGASG 
ASNKENEGPA-NH2), KEN-D™" (Ac-NKENEGPASGASGASGASGAQSAA 
ASDITSS-NH2) and KEN™"-D (Ac-NSASEGPASGASGASGASGAQRAALSD 
ITNS-NH2) peptides were designed with a 17-residue linker between the KEN- 
box sequence (NKENEGPA) and the D-box sequence (QRAALSDITNS), or 
between their respective mutant sequences (NSASEGPA and QSAAASDITSS). 
The peptides were synthesized by Cambridge Peptides. APC/C ubiquitination 
reactions were performed in the same way as described above using the co- 
purified APC/C“"' complex“. Peptides were added into the reaction to a final 
concentration ranging from 5 LM to 1000 LM. Reactions were incubated at room 
temperature for 15 min and were analysed using 8% SDS-PAGE. Gels were fixed 
and stained with Coomassie blue, then dried and exposed to film. 

Fitting structure coordinates into the human APC/C™“ EM map. The struc- 
ture coordinates of S. pombe Cdc16-Cdc26 complex (PDB 2XPI), Cdc27 (PDB 
3KAE), Apcl0 (PDB 1GQP), modelled Apc2 and MCC were fitted into the 
human APC/CM®S map (EMD-1591) based on the S. cerevisiae APC/C assign- 
ment**, using UCSF Chimera. 
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In pregnancy, trophoblast invasion and uterine spiral artery remodel- 
ling are important for lowering maternal vascular resistance and 
increasing uteroplacental blood flow. Impaired spiral artery remodel- 
ling has been implicated in pre-eclampsia, a major complication of 
pregnancy, for a long time but the underlying mechanisms remain 
unclear’’. Corin (also known as atrial natriuretic peptide-converting 
enzyme) is a cardiac protease that activates atrial natriuretic peptide 
(ANP), a cardiac hormone that is important in regulating blood pres- 
sure’. Unexpectedly, corin expression was detected in the pregnant 
uterus*. Here we identify a new function of corin and ANP in pro- 
moting trophoblast invasion and spiral artery remodelling. We show 
that pregnant corin- or ANP-deficient mice developed high blood 
pressure and proteinuria, characteristics of pre-eclampsia. In these 
mice, trophoblast invasion and uterine spiral artery remodelling were 
markedly impaired. Consistent with this, the ANP potently stimu- 
lated human trophoblasts in invading Matrigels. In patients with pre- 
eclampsia, uterine Corin messenger RNA and protein levels were 
significantly lower than that in normal pregnancies. Moreover, we 
have identified Corin gene mutations in pre-eclamptic patients, which 
decreased corin activity in processing pro-ANP. These results indicate 
that corin and ANP are essential for physiological changes at the 
maternal-fetal interface, suggesting that defects in corin and ANP 
function may contribute to pre-eclampsia. 

Pregnancy poses a serious challenge for maintaining normal blood 
pressure. Pregnancy-induced hypertension, a major cause of maternal 
and fetal deaths, occurs in approximately 10% of pregnancies*®. 
During pregnancy, the uterus undergoes profound morphological 
changes, including trophoblast invasion and spiral artery remodelling. 
In pre-eclampsia, impaired spiral artery remodelling is common, but 
the underlying mechanisms are unclear'*’°. Studies indicate that 
vascular growth factor receptors, angiotensin and oestradiol are 
involved in the disease’?™*. 

Corin is a cardiac protease that activates ANP, which is a cardiac 
hormone that regulates blood pressure and sodium homeostasis’*. In 
mice, lack of CORIN prevents ANP generation and causes hyper- 
tension’®. In humans, CORIN variants are associated with hyper- 
tension’’. Interestingly, Corin expression was detected in the pregnant 
mouse* (Fig. 1A) and human uterus (Supplementary Fig. 1). As a trans- 
membrane protein, CORIN is expected to act at the expression sites, 
suggesting a possible function in the pregnant uterus. 

To understand the role of CORIN in pregnancy, we created a mouse 
model in which a Corin transgene was expressed under a cardiac 
promoter (Fig. 1B). The transgenic and Corin knockout mice were 
crossed to generate mice expressing Corin only in the heart (“knock- 
out/transgenic mice’; Fig. 1C, D). In knockout/transgenic mice, trans- 
genic Corin expression restored pro-ANP processing in the heart 
(Supplementary Fig. 2) and normalized blood pressure (Fig. 1E), 


indicating that cardiac CORIN was sufficient to maintain normal 
blood pressure in non-pregnant mice. 

In pregnant Corin knockout mice, blood pressure increased at 
approximately 17 days post coitus and rose further before returning 
to the non-pregnant blood pressure level after delivery (Fig. 1F), which 
resembled late gestational hypertension in pre-eclamptic women. In 
Corin knockout/transgenic mice, which were normotensive, blood 
pressure increased similarly during pregnancy (Fig. 1G), indicating 
that cardiac Corin expression did not prevent pregnancy-induced 
hypertension. The data also show that in these mice, hypertension in 
pregnancy was not due to pre-existing high blood pressure. As well as 
in the uterus, Corin mRNA was detected in the umbilical cord and 
placenta (Supplementary Fig. 3). To distinguish the role of maternal 
Corin from that of placental or other fetal organs, Corin knockout 
females were mated with either wild-type or knockout males. The 
resulting fetuses carried one or no copy of the functional Corin gene. 
Normally, enzymes that are encoded by one gene copy are able to 
function. As shown in Fig. 1H, pregnant Corin knockout females that 
were mated with either wild-type or knockout males had similarly 
increased blood pressure, indicating that lack of maternal, but not fetal, 
Corin caused hypertension in pregnancy. 

Proteinuria is a hallmark of pre-eclampsia. Wild-type, Corin knock- 
out and knockout/transgenic mice had similar urinary protein levels 
before pregnancy and at mid gestation. However, the levels increased 
in Corin knockout and knockout/transgenic mice at late gestation 
(Fig. 11), consistent with reported proteinuria in mouse models of 
pre-eclampsia’®. Ischaemic glomeruli, indicated by fewer red blood 
cells, were found in pregnant Corin knockout and knockout/transgenic 
mice (Fig. 1J, a-f) but not in non-pregnant mice (Supplementary Fig. 4). 
Periodic acid—Schiff staining revealed increased extracellular matrixes 
and collapsed glomerular capillaries in pregnant Corin knockout and 
knockout/transgenic mice (Fig. 1J, g—i). Electron microscopy showed 
narrow glomerular capillary lumens and thick basement membranes 
(Fig. 1K), suggesting endotheliosis and increased extracellular mat- 
rices. Additional pathological features such as necrotic cells and cal- 
cium deposits in the placental labyrinth also existed in these mice 
(Supplementary Fig. 5), indicating insufficient uteroplacental per- 
fusion. Consistent with this, Corin knockout and knockout/transgenic 
mice had smaller litters (7.1 + 2.3 (n = 28) and 6.8 + 2.7 (m = 28) pups 
per litter, respectively, versus wild-type mice, which had 9.1 + 1.2 
(n = 21) pups per litter; P< 0.001 in both cases). 

We examined embryos at embryonic day 12.5 (E12.5), an early time 
point before blood pressure increase in Corin knockout and knockout/ 
transgenic mice, and E18.5 (two days before delivery). Wild-type E12.5 
embryos showed obvious trophoblast invasion, shown by cytokeratin 
staining (Fig. 2a), and large vessels mostly in the deep decidua, shown 
by smooth-muscle o-actin (SMA) staining (Fig. 2b), indicating that 
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Figure 1 | Hypertension, proteinuria and renal pathology in pregnant 
Corin knockout and knockout/transgenic mice. A, Corin mRNA expression 
in mouse uteruses. B, Corin transgenic (Tg) construct. C, D, Western blot 
analysis of CORIN protein in wild-type (WT), Corin knockout (KO) and 
knockout/transgenic mice. E, Blood pressure (BP, mean + s.d.) in non- 
pregnant females. F, G, Blood pressure increased in Corin knockout (F) and 
knockout/transgenic (G) mice in pregnancy. Data are mean + s.d. *P < 0.05 
versus WT of the same time point. P < 0.05 versus non-pregnant level of the 
same genotype. H, Similar blood-pressure changes in Corin knockout females 
mated with knockout or WT males. I, Late gestational proteinuria in Corin 


smooth muscles in the superficial decidua were replaced by invading 
trophoblasts. In contrast, trophoblast invasion in Corin knockout and 
knockout/transgenic embryos was markedly reduced (Fig. 2a) and 
smaller arteries were found in both superficial and deep decidua 
(Fig. 2b). In E18.5 wild-type embryos, more abundant trophoblasts 
were found in the decidua and myometrium compared with those in 
Corin knockout and knockout/transgenic mice (Fig. 2c, d). By haema- 
toxylin and eosin staining, larger and more abundant decidual spiral 
arteries were observed in wild-type than in Corin knockout or knock- 
out/transgenic mice (Fig. 2e). Figure 2f-h shows strong cytokeratin 
(trophoblasts) staining but weak von Willebrand factor (endothelial) 
and SMA (smooth muscle) staining in wild-type decidual and myo- 
metrial arteries. These data indicate that trophoblast invasion and 
spiral artery remodelling were impaired in Corin knockout and knock- 
out/transgenic mice, and that this defect occurred before blood pres- 
sure increased in these mice. 

CORIN activates ANP in the heart’? but it was unknown whether the 
CORIN function in pregnancy was also mediated by ANP. Pro-ANP is 
expressed in the non-pregnant and pregnant uterus (Supplementary 
Fig. 6). If CORIN acts on pro-ANP to promote trophoblast invasion 
and spiral artery remodelling, thereby preventing hypertension in preg- 
nancy, ANP (also known as Nppa) and Corin knockout mice should 
have similar phenotypes. ANP knockout mice are hypertensive (Fig. 3a) 
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knockout and knockout/transgenic mice. Data are mean + s.d. **P < 0.01,n = 
7 or 8 per group. J, a-i, Renal ischaemia in pregnant Corin knockout and 
knockout/transgenic mice. E18.5 sections are stained with haematoxylin and 
eosin (H&E), Masson trichrome or periodic acid—Schiff (PAS). Scale bar, 20 
uum. Red blood cells (yellow arrows) and open capillaries (red arrows) in WT 
glomeruli are shown. K, Narrow glomerular capillary lumen (L) and thick 
basement membranes (red arrows) in Corin knockout and knockout/ 
transgenic mice at E18.5 shown by electron microscopy. GAPDH, 
glyceraldehyde 3-phosphate dehydrogenase; NP, non pregnant; pA, poly A. 


but their blood pressure was not monitored during pregnancy’’. We 
found similarly increased blood pressure in pregnant ANP knockout 
mice (Fig. 3b). The mice also had late gestational proteinuria (Fig. 3c) 
and smaller litters (4.4 + 1.7 (n = 25) versus wild-type, 9.1 + 1.2 
(n = 21) pups per litter, P < 0.001). By immunostaining, impaired 
trophoblast invasion and smaller spiral arteries were observed in 
E12.5 embryos (Fig. 3d, e). In E18.5 embryos, ANP knockout mice 
had far fewer trophoblasts (Fig. 3f, g) and smaller arteries (Fig. 3h) in 
the decidua and myometrium than those in wild-type mice. Consistent 
with this, weak cytokeratin-staining but strong von Willebrand factor- 
staining were found in arteries in ANP knockout mice (Fig. 3i). Thus, 
ANP and Corin knockout mice had very similar phenotypes, indicating 
that the role of CORIN in pregnancy is probably mediated by ANP. 
In the heart, CORIN produces ANP, which then regulates blood 
pressure by promoting natriuresis and vasodilation’. Here we found 
that lack of CORIN and ANP impaired trophoblast invasion and spiral 
artery remodelling, which was not rescued by cardiac Corin expression 
in Corin knockout/transgenic mice. ANP is known to relax vascular 
smooth muscles. Recently, ANP and its downstream cyclic GMP- 
dependent protein kinase were shown to be important in angiogenic 
processes by promoting endothelial regeneration”””'. Thus, ANP may 
function locally to remodel uterine arteries. Our results also indicate 
that ANP may directly promote trophoblast invasion (Fig. 4a), and we 
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Figure 2 | Impaired trophoblast invasion and spiral artery remodelling in 
Corin knockout and knockout/transgenic mice. a, b, E12.5 embryo sections 
were stained for trophoblasts (a) or smooth muscles (b). Fetal (F) and maternal 
(M) sides are indicated. Boxed areas in top panels are shown at a higher 
magnification (200). c, E18.5 embryo sections were stained for trophoblasts. In 
lower panels (X 100), yellow lines show the decidua and myometrium boundary. 


therefore tested this idea. We found that ANP markedly stimulated 
human trophoblasts to invade Matrigels (Fig. 4b) (Supplementary 
Fig. 7a). In these cells, ANP receptor (also known as atrial natriuretic 
peptide receptor 1) mRNA expression was confirmed (Supplementary 
Fig. 7b) and ANP-stimulated intracellular cGMP production was 
detected (Fig. 4c) (Supplementary Fig. 7c). 

Our findings emphasize the importance of local ANP production by 
CORIN, which acts on trophoblasts and vascular cells in the uterus. 
Because heart-derived ANP circulates inside the vessel, our model may 
explain why cardiac CORIN failed to promote trophoblast invasion and 
uterine artery remodelling, as shown in Corin knockout/transgenic 
mice. To verify this hypothesis, we quantified Corin mRNA and 
protein in human uteruses by polymerase chain reaction with reverse 
transcription (RT-PCR) and enzyme-linked immunosorbent assay 
(ELISA). The levels were low in non-pregnant women but increased 
in pregnant women (Fig. 4d, e). In pre-eclamptic women, the levels 
were significantly lower than in normal pregnancies. Similar results 
were found by immunostaining (Fig. 4d and Supplementary Fig. 8). 
Consistent with this, pro-ANP levels in uterine tissues were signifi- 
cantly higher in pre-eclamptic women than in normal pregnant women 
(Fig. 4f), indicating that reduced uterine Corin expression impaired 
pro-ANP processing in these patients. Corin is a membrane-bound 
protein*'*, and recent studies showed that CORIN can be shed from 
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d, Quantitative data (mean + s.d.) of cytokeratin (Cytok) staining. e, Fewer and 
smaller decidual spiral arteries (arrows) in H&E-stained E18.5 Corin knockout 
and knockout/transgenic embryos. f-h, Co-staining of SMA, von Willebrand 
factor (vWF), cytokeratin and nuclei in E18.5 embryos. Red arrows indicate 
cytokeratin (green) signals, white arrows indicate von Willebrand factor (red) 
signals and orange arrows indicate mixed (yellow) signals. 


cardiomyocytes and that soluble CORIN is found in human plasma”. 


We found that plasma CORIN levels were higher in pre-eclamptic 
patients than non-pregnant or normal pregnant women (Fig. 4g). 
Thus, CORIN levels in plasma did not reflect the levels in tissues, 
indicating that plasma CORIN was probably derived from the heart, 
where Corin expression increased in response to high blood volume 
and high blood pressure in pregnancy. These results provide further 
support for a local function of CORIN in the pregnant uterus. 

We next sequenced the CORIN gene” in pre-eclamptic patients and 
identified a mutation that alters Lys to Glu at position 317 in low- 
density lipoprotein receptor repeat 2 in one woman (Fig. 4h, j) and 
another mutation altering Ser to Gly at position 472 in the frizzled 2 
domain in two women from the same family who had pre-eclampsia 
(Fig. 4i, j). In functional studies, Lys317Glu and Ser472Gly mutations 
did not affect CORIN expression in HEK293 cells but markedly 
reduced CORIN activity in processing pro-ANP (Fig. 4k—n). The data 
were consistent with previous findings that Low-density lipoprotein 
receptor repeats and frizzled domains are critical for CORIN activity”’, 
suggesting that the mutations may impair CORIN function in the 
patients, thereby contributing to pre-eclampsia. Interestingly, 
CORIN variants in the frizzled 2 domain that impaired CORIN func- 
tion have been reported in African American people’””’, a high-risk 
population for pre-eclampsia. 
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Figure 3 | Hypertension, proteinuria and uteroplacental pathology in 
pregnant ANP knockout mice. a, Blood pressure (mean + s.d.) in non- 
pregnant females, **P < 0.01. b, Elevated blood pressure (mean + s.d.) in 
pregnant ANP knockout mice. {P < 0.05 versus non-pregnant level. 

c, Gestational proteinuria in ANP knockout mice. Data are mean = s.d. 

d, e, Impaired trophoblast invasion and smooth muscle remodelling in E12.5 
embryos stained for cytokeratin (d) or SMA (e). Boxed areas in top panels are 
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shown at a higher magnification (200). f, Impaired trophoblast invasion in 
E18.5 embryos stained for cytokeratin. g, Quantitative data (mean + s.d.) of 
cytokeratin staining in E18.5 ANP knockout embryos. h, Impaired decidual and 
myometrial artery remodelling (arrows) in H&E-stained E18.5 ANP knockout 
embryos. i, Co-staining of cytokeratin, von Willebrand factor and nuclei in 
E18.5 ANP knockout embryos. Red arrows indicate cytokeratin (green) signals 
and white arrows indicate von Willebrand factor (red) signals. 
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Figure 4 | ANP-stimulated human 
trophoblast invasion, and impaired 
uterine Corin expression and Corin 
mutations in pre-eclamptic 
patients. a, A model showing that 
CORIN-produced ANP in the 
pregnant uterus promotes 
trophoblast invasion (red arrows) 
and vascular-wall remodelling (black 
arrows). b, c, ANP-stimulated 
human BeWo trophoblasts and 
primary trophoblasts in Matrigel 
invasion (b) and intracellular cGMP 
production (c). Data are mean = s.d. 
*P < 0.05; **P < 0.01 versus control. 
d-f, Corin mRNA (d) and protein 
(e), and pro-ANP levels (f) in human 
uterus samples. Horizontal lines 
indicate mean values. g, Plasma- 
soluble CORIN levels (mean + s.d.) 
in pre-eclamptic patients and normal 
controls. h-j, CORIN gene mutations 
causing Lys317Glu (h) and 
Ser472Gly (i) changes in CORIN 

(j). k, 1, Expression of Lys317Glu and 
Ser472Gly mutants in HEK293 cells 
(top panels). Vector, WT CORIN 
and inactive CORIN Arg801Ala and 
Ser985Ala mutants were controls. 
Lys317Glu and Ser472Gly mutations 
reduced pro-ANP processing activity 
(bottom panels). m, n, Quantitative 
data (mean + s.d.) from three 
experiments or more. Fz, frizzled; 
LDLR, LDL receptor; SR, scavenger 
receptor; TM, transmembrane. 
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Previously, high levels of plasma pro-ANP or ANP were detected in 
pre-eclamptic patients’”**. As shown by our plasma soluble CORIN 
data, plasma protein levels may not reflect those in tissues. Taken 
together, our data show a novel local function of CORIN and ANP 
in promoting trophoblast invasion and spiral artery remodelling to 
prevent hypertension in pregnancy. The data suggest that impaired 
Corin expression or function in the pregnant uterus may be an import- 
ant mechanism underlying pre-eclampsia. Studies to better under- 
stand impaired uterine Corin expression in pre-eclamptic patients 
may help to develop new strategies to enhance the CORIN-ANP 
pathway and prevent or treat this life-threatening disease. 


METHODS SUMMARY 


Corin and ANP knockout mice have been described previously'®’. Transgenic 
mice with cardiac Corin expression were generated using a heart-specific pro- 
moter. Blood pressure was measured by radiotelemetry’’. Tissue sections from 
non-pregnant and pregnant mice were stained with haematoxylin and eosin, 
Masson’s trichrome, periodic acid—-Schiff or von Kossa, or immunostained with 
antibodies against cytokeratin, SMA, von Willebrand factor or CORIN. Renal 
sections were also examined by electron microscopy. Trans-well invasion assay 
was carried out with human primary villous trophoblasts (ScienCell) and tropho- 
blastic JEG3, BeWo, JAR cell lines (ATCC) in Matrigel Invasion Chambers (BD 
Biosciences). ANP-stimulated cGMP production in trophoblasts was assayed in 
96-well plates. Intracellular cGMP levels were determined using an enzyme 
immunoassay kit (Enzo Life Sciences). Corin levels in human blood and uterus 
tissue samples were measured using ELISA”. Pro-ANP levels in human uterus 
tissues were also measured using ELISA (Alpco Diagnostics). Corin gene exons™ 
from pre-eclamptic patients were PCR-amplified and sequenced directly. Corin 
gene mutations that were identified were studied by expressing mutant CORIN 
proteins in HEK293 cells and testing their activities in pro-ANP processing assays, 
as described previously”®. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Knockout and transgenic mice. Corin knockout mice were described previously’®. 
ANP knockout mice (B6.129P2-Nppa ”!”"*/J)'° were from the Jackson Laboratory. 
To make transgenic mice expressing Corin in the heart, the full-length mouse Corin 
cDNA was inserted into a construct driven by the mouse %-myosin heavy chain 
(a-MHC) promoter. Pro-nuclear microinjection and breeding of transgenic mice 
were carried out at the Case Western Reserve University Transgenic Core. Corin 
knockout and transgenic mice were crossed to generate knockout/transgenic mice. 
Littermates were used as controls. The animal study was conducted in accordance 
with the National Institutes of Health guidelines and approved by the Institutional 
Animal Care and Use Committee at the Cleveland Clinic. 

Blood-pressure monitoring. Radiotelemetry was used for real-time blood- 
pressure monitoring in conscious and unrestrained mice’®. Female mice (8-12 
weeks old) were chronically instrumented in the left carotid artery with a PA-C10 
device (Data Sciences International) and rested for at least 7 days to recover from 
the surgery. The mice were mated and checked for vaginal plugs to establish 
gestation timing. The day on which a plug was observed was defined as E0.5. 
The mating mice were homozygous except for those in the fetal testing experi- 
ment. Telemetry receivers (model RPC-1) were placed under individual cages for 
data acquisition using the Dataquest A.R.T. 4.0 Gold System (Data Sciences 
International). Data presented were from continuous recording of at least 6 h 
per day (10:00 to 16:00). 

Urinary protein measurement. Urine samples were collected from non-pregnant 
mice and pregnant mice at mid (8-10 days post coitus) and late (16-18 days post 
coitus) gestational stages. Urinary protein levels were measured using a colorimetric 
assay based on a modified Bradford method (Bio-Rad). 

RT-PCR, western blot analysis and ELISA. Total RNAs were isolated from 
cultured cells or mouse and human tissues using TRIzol reagent (Invitrogen) or 
an RNeasy kit (Qiagen), and were used to synthesize the first strand cDNAs. RT- 
PCR was carried out using oligonucleotide primers that were specific for the 
mouse or human CORIN, mouse ANP, or human ANP receptor genes. 
Quantitative RT-PCR for human CORIN mRNA expression in uterus tissues 
was carried out using the PRISM 7500 System (Applied Biosystems). The B-actin 
gene was used as an internal control. Quantitative RT-PCR for mouse ANP 
mRNA in uteruses was carried out using the iCycler system (Bio-Rad). For western 
blot analysis of CORIN protein, membrane fractions from tissue homogenates 
were isolated by ultracentrifugation, as described previously”. Proteins were 
analysed by SDS-polyacylamide agarose gel electrophoresis (SDS-PAGE) and 
western blot using a polyclonal antibody (Berlex Biosciences). Western blot 
analysis of pro-ANP in heart samples was carried out using a polyclonal antibody 
(Santa Cruz). Processing pro-ANP by CORIN in transfected cells was analysed by 
western blot analysis, as described previously’. Pro-ANP in human uterus tissues 
was measured by an amino-terminal (NT) pro-ANP ELISA kit from Alpco 
Diagnostics. Human CORIN in uterus tissues or plasma was measured by 
ELISA, as described previously*’. 

Histology and immunohistochemistry. Tissues were fixed with 4% para- 
formaldehyde and embedded in paraffin. Sections were stained with H&E, 
Masson’s trichrome, PAS or von Kossa. For immunohistochemical or immuno- 
fluorescent analysis, antibodies against SMA (Sigma-Aldrich), von Willebrand 
factor (Sigma-Aldrich) and cytokeratin (Dako) were used to label smooth muscle 
cells, endothelial cells and trophoblasts, respectively. For human CORIN, an 
antibody from Berlex Biosciences was used. Secondary antibodies were conjugated 
with horseradish peroxidase or Alexa Fluor 488 (green) or Alexa Fluor 594 (red) 
(Invitrogen). Tissue sections were mounted with or without DAPI-containing 
(blue) mounting medium (Dako). For ANP expression in mouse uterus tissues, 
a polyclonal antibody from Millipore was used. Control sections were treated 
similarly but without the primary antibodies. Photographs were taken with a light 
or fluorescent microscope equipped with a digital camera (Olympus). Data are 
from experiments using five or more mice per study group. 

For immunohistochemical analysis of trophoblast invasion in mouse embryos, 
tissue samples from at least five mice per group, and at least two implant sites per 
mouse were used. Serial sections (>50 per embryo) of 5 kum in thickness were 
prepared. The position of the maternal artery was used as a guide to orient section 
positions. At least 4-6 sections from the centre of the placenta of each embryo were 
used for immunohistochemical analysis. Slides that were stained for cytokeratin 
were examined by two individuals. The sections that showed the deepest tropho- 
blast invasion are presented. These sections were also analysed by ImagePro soft- 
ware to quantify cytokeratin staining. For each section that was analysed, the entire 
area of the decidua and myometrium was scanned by the software. 

Electron microscopy. Kidneys from pregnant mice at E18.5 were fixed in 3% 
glutaraldehyde, treated with 1% osmium tetroxide and embedded in an Araldite— 
Epon mixture. Semi-thin sections (0.6 jum) were prepared and examined with a 
transmission electron microscope (JEOL JEM-1210) at the Lerner Image Core of the 


Cleveland Clinic. Data are from experiments using at least three mice per study 
group. 

Trans-well invasion assay. Human trophoblastic JEG3, BeWo and JAR cells from 
the American Type Culture Collection were cultured in Minimum Essential 
Medium (JEG3), RPMI1640 (JAR) and F-12K (BeWo) medium, respectively, with 
10% FBS at 37 °C. Primary human villous trophoblasts from ScienCell Research 
Labs (Carlsbad) were cultured in the Trophoblast Medium (ScienCell) with 10% 
EBS. Transwell invasion assays were carried out using the BioCoat Growth Factor 
Reduced Matrigel Invasion Chambers (pore size of 8 tm) and control inserts (no 
Matrigel coating) (BD Biosciences) in 24-well plates. Culture medium containing 
human ANP (Calbiochem) was added to the bottom wells, and cell suspension 
(5 X 10*) was added to the top wells and incubated at 37 °C for 24 h. Non-invading 
cells were removed from the upper surface of the Matrigel layer by gentle 
scrubbing. The cells on the lower surface of the membrane were stained using 
Diff-Quick staining solutions. The membranes were excised and mounted onto 
glass slides. Invasion indices were determined by counting the number of stained 
cells on the membrane under a light microscope. The assay was carried out in 
duplicate in at least three independent experiments. 

cGMP assay. ANP-stimulated intracellular cGMP production assay was per- 
formed with JEG3, JAR and BeWo cells and primary human trophoblasts using 
a method described previously’. The cells were grown in 96-well plates. Confluent 
cells were washed once with serum-free medium. Human ANP was added to 
serum-free medium and incubated with cells at 37 °C for 30 min. In these experi- 
ments, ANP was more potent in stimulating intracellular cGMP production when 
serum-free medium was used (data not shown). The cells were lysed with 0.1 M 
HCL. Intracellular cGMP levels in ANP-stimulated cells were determined using an 
EIA kit (Enzo Life Sciences). Each experimental condition was assayed in duplicate 
in at least three independent experiments. 

Human blood and tissue samples. The study was approved by local ethics 
committees and participants gave informed consent. Women of normal preg- 
nancy or with pre-eclampsia, and age-matched non-pregnant normal controls 
who underwent routine medical check-ups were recruited. All participants were 
ethnic Han Chinese. Hypertension was defined as diastolic pressure >90 mm Hg 
and/or systolic pressure >140 mm Hg on at least two occasions. Pre-eclampsia 
was defined as hypertension that appeared after 20 weeks of gestation with 
proteinuria (>300 mg urinary protein per 24 h). Patients with chronic hyper- 
tension, chronic kidney disease, diabetes and heart disease were excluded. Uterus 
tissues were obtained during caesarean sections in pregnant women or operations 
for uterine leiomyoma in non-pregnant women. Clinical characteristics of women 
who provided blood and those who provided uterus tissue samples are summarized 
in Supplementary Tables 1 and 2, respectively. 

CORIN gene sequences in patients. Blood samples from 56 patients with pre- 
eclampsia were collected into tubes containing EDTA as an anticoagulant. 
Genomic DNA was extracted from white blood cells using the QlAamp DNA 
Mini kit (Qiagen) and used in PCR to amplify exon sequences of the CORIN 
gene”. PCR products were used for direct DNA sequencing. Mutations that were 
identified were verified by independent PCR and DNA sequencing. Additional 
PCR and DNA sequencing were carried out with DNA samples from more than 
100 normal controls to verify that mutations that were identified in patients did 
not exist in the normal population. 

Expression and functional analysis of Corin mutants. Plasmids expressing 
human wild-type Corin and two inactive mutants Arg801 Ala and Ser985Ala, in which 
the activation cleavage site and catalytic site residues were mutated, respectively, were 
described previously*’. Plasmids expressing Corin mutants Lys317Glu or Ser472Gly 
were constructed by PCR-based mutagenesis. Recombinant CORIN proteins that 
were expressed by these plasmids contained a carboxy-terminal V5 tag to be detected 
by an anti-V5 antibody (Invitrogen)’°. Plasmids were transfected into HEK293 cells 
using Lipofectamine 2000 (Invitrogen). Cells were lysed and proteins were analysed 
by western blot using an anti-V5 antibody. To analyse the function of CORIN, 
recombinant human pro-ANP in conditioned medium was added to HEK293 cells 
expressing Corin wild-type or mutants and incubated at 37 °C for 2 h. Pro-ANP and 
ANP in the medium were immunoprecipitated and analysed by western blot. Protein 
bands on X-ray films were scanned by densitometry. The percentage of pro-ANP to 
ANP conversion was calculated as described previously”°. 

Statistical analysis. Results are presented as mean + s.d. Differences between two 
groups were analysed with the Student’s t-test. Data involving more than two 
groups were analysed by analysis of variance followed by the Tukey multiple 
comparison test. Comparisons for Corin mRNA and protein and pro-ANP levels 
in human uterus samples were carried out using the Mann-Whitney-Wilcoxon 
test. A P value of less than 0.05 was considered statistically significant. 
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Spontaneous coherence in a cold exciton gas 


A. A. High’, J. R. Leonard’, A. T. Hammack', M. M. Fogler’, L. V. Butov'!, A. V. Kavokin?, K. L. Campman’* & A. C. Gossard* 


If bosonic particles are cooled down below the temperature of 
quantum degeneracy, they can spontaneously form a coherent state 
in which individual matter waves synchronize and combine. 
Spontaneous coherence of matter waves forms the basis of a number 
of fundamental phenomena in physics, including superconductivity, 
superfluidity and Bose-Einstein condensation’’. Spontaneous 
coherence is the key characteristic of condensation in momentum 
space’. Excitons—bound pairs of electrons and holes—form a model 
system to explore the quantum physics of cold bosons in solids*”. 
Cold exciton gases can be realized in a system of indirect excitons, 
which can cool down below the temperature of quantum degeneracy 
owing to their long lifetimes’. Here we report measurements of 
spontaneous coherence in a gas of indirect excitons. We found that 
spontaneous coherence of excitons emerges in the region of the 
macroscopically ordered exciton state’ and in the region of vortices 
of linear polarization. The coherence length in these regions is much 
larger than in a classical gas, indicating a coherent state with a much 
narrower than classical exciton distribution in momentum space, 
characteristic of a condensate. A pattern of extended spontaneous 
coherence is correlated with a pattern of spontaneous polarization, 
revealing the properties of a multicomponent coherent state. We 
also observed phase singularities in the coherent exciton gas. All 
these phenomena emerge when the exciton gas is cooled below a 
few kelvin. 

There are intriguing theoretical predictions for a range of coherent 
states in cold exciton systems, including the Bose-Einstein condensate’, 
a BCS-like condensate’, the charge-density wave®, and a condensate 
with spontaneous time-reversal symmetry breaking’. Because excitons 
are much lighter than atoms, quantum degeneracy can be achieved in 
excitonic systems at temperatures orders of magnitude higher than the 
microkelvin temperatures needed in atomic vapours’. Exciton gases 
need be cooled down to a few kelvin to enter the quantum regime. 
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Figure 1 | Emission, interference and coherence patterns of indirect 
excitons. a, Diagram of CQW structure: n* -GaAs (blue), Alp 33Gag.¢7As 
(grey), GaAs quantum well (yellow). V, is applied voltage. Ellipses indicate 
indirect excitons composed of electrons (—) and holes (+). b, CQW band 
diagram; e, electron; h, hole; E, energy. The arrow indicates an indirect exciton. 
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Although the temperature of the semiconductor crystal lattice (Tj) 
can be lowered well below 1K in helium refrigerators, lowering the 
temperature of the exciton gas (Tx) to even a few kelvin is challenging’®"’. 
Owing to recombination, excitons have a finite lifetime that is too short 
to allow cooling to low temperatures in usual semiconductors. In order 
to create a cold exciton gas with Ty close to Tj, the exciton lifetime 
should considerably exceed the exciton cooling time. As well as this, the 
realization of a cold and dense exciton gas requires an excitonic state to 
be the ground state and to have lower energy than the electron-hole 
liquid’. 

A gas of indirect excitons fulfils these requirements. An indirect 
exciton can be formed by an electron and a hole confined in separate 
quantum-well layers (Fig. 1a, b). The spatial separation allows the overlap 
of electron and hole wavefunctions to be controlled. In this way, indirect 
excitons can be produced with radiative lifetimes and spin relaxation 
times orders of magnitude longer than those of direct excitons'*""’. 

In earlier studies, evidence for spontaneous coherence was obtained 
for indirect excitons in coupled quantum wells (CQWs)"° and for indirect 
excitons in quantum Hall bilayers'”’*. The onset of spontaneous 
coherence was evidenced by a strong enhancement of the rates of 
recombination’® and tunnelling’, respectively. The results of other 
transport and optical experiments were also consistent with spontan- 
eous coherence of indirect excitons®'* ”. However, no direct measure- 
ment of coherence was performed in these studies. 

Exciton coherence is evidenced by coherence of their light emission, 
which can be studied by interferometry. In our earlier work, we 
reported an enhancement of the exciton coherence length in the 
macroscopically ordered exciton state’*** (MOES). However, these 
experiments used a single-pinhole interferometric technique, which 
does not measure the coherence function, and the derivation of the 
exciton coherence length in refs 23 and 24 was based on a mathematical 
analysis of the data. 
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c, Diagram of the interferometric set-up. d, Emission pattern (luminescence). 
e, Interference pattern Tinterr(x, y) for 6x = 2 tm. f, Pattern of the amplitude of 
the interference fringes Ajnterr(x, y), presenting a map of coherence. The 
temperature in the refrigerator at the sample is Tyath = 0.1 K. 
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Here we report the direct measurement of spontaneous coherence in 
a gas of indirect excitons in CQWs. These indirect excitons may have 
four spin projections on the z direction normal to the CQW plane: 
J,= —2,—-1, +1, +2. The states J, = —1and +1 contribute to left- and 
right-circularly polarized emission and their coherent superposition to 
linear polarized emission, whereas the states J,= —2 and +2 are 
dark”. The exciton condensate is a four-component coherent state 
in general. The build-up of exciton coherence should manifest itself in 
an increase of the coherence length and of the degree of polarization of 
the exciton emission. The former phenomenon is general for both 
one- and multicomponent condensates’, whereas the latter is specific 
to multi-component condensates”. In this work, we report the 
emergence of both long-range spontaneous coherence of excitons 
and spontaneous polarization. A pattern of extended spontaneous 
coherence, measured by shift interferometry (see below), is correlated 
with a pattern of spontaneous polarization, measured by polarization- 
resolved imaging. These two experiments reveal the properties of a 
multi-component coherent state. 

The pattern of the first-order coherence function g,(6x) is measured 
by shift interferometry: the emission images produced by each of the 
two arms of the Mach-Zehnder interferometer (Fig. 1c) are shifted 
with respect to each other to measure the interference between the 
emission of excitons spatially separated by 6x. Details of the experi- 
ment are given in Supplementary Information. 

Extended spontaneous coherence is observed in the region of rings 
in the exciton emission pattern. Exciton rings—including the inner 
ring, external ring and localized bright spot (LBS) rings—were 
observed earlier’. The external and LBS rings form on the boundaries 
between electron-rich and hole-rich regions; the former is created by 
current through the structure (specifically, by the current filament at 
the LBS centre in the case of the LBS ring), whereas the latter is created 
by optical excitation”®”’”. The external and LBS rings are sources of cold 
excitons. In the area of these rings coherence forms spontaneously. 
Figure 1d shows a segment of the exciton emission pattern, with a 
section of the external ring and smaller LBS rings. 

The pattern of interference fringes is shown in Fig. le and the map of 
their amplitude, Ajnters in Fig. 1f. The quantity Ajnterr describes the 
degree of coherence of excitons, as detailed below. The regions of 
extended spontaneous coherence of excitons correspond to the green 
colour in Fig. 1f. 

Figure 2 presents the patterns of coherence of emission from indirect 
excitons in regions of an LBS and the external ring. The observed 
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Figure 2 | Coherence of indirect excitons in regions of an LBS and the 
external ring. a—d, A region of an LBS; e-h, a region of the external ring. 
Shown are the emission pattern (a, e), the interference pattern at shift 

dx = 2 um (b, f), the amplitude A inters(x, y) of interference fringes (c, g), and the 
linear polarization of exciton emission Pj, = (I. — I,)/(U + [y) (d, h). i, y-axis 
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properties of exciton coherence are qualitatively similar around both 
these sources of cold excitons. We first consider an LBS region. At low 
temperatures, a strong enhancement of Ainterr is observed at distance 
r~ 19 = 7 um away from the LBS centre (Fig. 2b, c, i). This enhance- 
ment of the degree of coherence shows the emergence of extended 
spontaneous coherence of excitons. 

Furthermore, the phase of the interference fringes, @inters experi- 
ences a shift at r ~ ro = 7 jim, which defines a phase domain boundary 
(Fig. 2b). The shift in phase correlates with the enhancement of Ainterr 
(Fig. 2b, c). Its magnitude 59= get"! — git". increases with 5x 
(Fig. 2j). The interference pattern in the shift-interferometry experi- 
ment with shift 6x can , be simulated using the formula 

Tintert =|¥(r) + e499 P(r 8x)|’, where q, = 21a/A sets the period of 
interference fringes (x is a small tilt angle between the image planes of 
the interferometer arms, 2 is the emission wavelength) and the 
complex function Y(r) represents the source amplitude at point 
r. For a flow of excitons with momentum q, Yr) =e", so 
that Tinterr = 2 + 2cos(qy + qdx) and the shift in the phase of the 
interference fringes means a jump in (average) measured exciton 
momentum 6q ~ Sdintert/Ox ~ 2 um”! at r= 1. 

Figure 2d presents a pattern of linear polarization around an LBS. It 
spatially correlates with the pattern of the amplitude and phase of the 
interference fringes: compare Fig. 2b, c and d. At r 2 ro a vortex of 
linear polarization with the polarization perpendicular to the radial 
direction is observed. Such polarization vortices appear owing to pre- 
cession of the Stokes vector for excitons propagating out of the LBS 
origin (see Supplementary Information for details). 

To summarize, close to the heating sources within the LBS ring the 
exciton gas is hot, and no spontaneous coherence forms there (the 
heating of the exciton gas is due to the current filament at the LBS 
centre and the binding energy released at the exciton formation in the 
ring”’). This is revealed by the small amplitude of the interference 
fringes. Excitons cool down with increasing distance r away from 
the heating sources so that they can approach the transition temper- 
ature to a coherent state. At r = ro, the (average) exciton momentum 
reduces and the coherence degree sharply rises, indicating the emer- 
gence of extended spontaneous coherence of excitons. This is revealed 
by the shift in the phase of the interference fringes and the strong 
enhancement of the amplitude of the interference fringes, respectively. 
The polarization vortex emerges along with extended spontaneous 
coherence at r= 1g, revealing the properties of a multicomponent 
coherent state. 
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cross-sections Of Iinter(X; y) at x = 2 um (black lines) and x = 12 um (red lines) 
at Thath = 0.1 K and 8 K. j, The shift in the phase of interference fringes in b at 
r~7 «um (black) and in f at ~4 jum away from the centre of the external ring 
(blue) versus 5x. Tpath = 0.1 K for a-h, j. 
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Figure 3 | First-order coherence function and distribution in momentum 
space. a, Measured | gi(8x) | for the polarization vortex (squares) and LBS 
centre (circles), and simulated |g,(5x)| for a quantum (black line) and classical 
(blue line) gas. b, Simulated |g,(8x)| for a quantum (black) and classical (blue) 


Similar phenomena are observed in the external ring region. At low 
temperature, the MOES forms in the external ring’ (Figs 1d and 2e) 
and a periodic polarization texture forms around the periodic array of 
beads in the MOES (Fig. 2h). Figure 2f and g shows the extended 
spontaneous coherence of excitons observed in the MOES. It emerges 
at low temperatures, along with the spatial order of exciton beads and 
periodic polarization texture. 

We now discuss the measurements of the first-order coherence func- 
tion. Coherence of the exciton gas is directly characterized by coherence 
of exciton emission, described by the first-order coherence function 
gi(6r). In turn, this function is given by the amplitude of the interfer- 
ence fringes Ainterr(Sr) in ‘the ideal experiment’ with perfect spatial 
resolution. In practice, the measured A inter(5r) is given by the convolu- 
tion of g, (Sr) with the point-spread function (PSF) of the optical system 
used in the experiment”. The PSF width corresponds to the spatial 
resolution of the optical system (~1.5 jtm in our experiments). 

The measurements of Ainter(5x) in the polarization vortex and in 
the LBS centre are presented in Fig. 3a. In the hot LBS centre, Ainterr 
quickly drops with 6x and the shape Ainterr(Ox) fits well to the PSF, 
which is shown by the blue line. In the polarization vortex, g,(6x) 
extends to large 5x, demonstrating extended spontaneous coherence. 
A fit to the experimental points computed using a model described 
below is shown by the black line (Fig. 3a). 

Figure 3b and c demonstrates the relation between the first-order 
coherence function and the particle distribution in momentum space. 
Figure 3b presents g,(6x) for a classical gas (blue dashed line) and for 
a quantum gas (black dashed line); both curves are for a spatially 
homogeneous gas of non-interacting particles with a quadratic disper- 
sion (see Supplementary Information). Both gases are at 0.1K; the 
occupation number of the q = 0 momentum state (1) is <1 for the 
classical gas, but is 5,000 for the quantum gas. The convolution of these 
gi (6x) with the PSF produces data shown as black and blue solid lines, 
which fit to Ajnter(Ox) in the spin polarization vortex and in the LBS 
centre, respectively (Fig. 3a, b). The Fourier transform of g,(6x) in 
Fig. 3b gives the momentum occupation factor ng shown in Fig. 3c. 

Figure 3b and c illustrates that a classical gas is characterized by a 
broad distribution in momentum space n, and a narrow first-order 
coherence function g;(r), whereas a quantum gas is characterized by a 
narrow n, and an extended g;(r). For a classical gas, g(r) reduces 
substantially within the thermal de Broglie wavelength gp, which 
scales x T ? and is about 0.5 jm at 0.1 K. The extension of gi (7) well 
beyond Agp indicates a coherent exciton state. 

Figure 3a also illustrates why 5x = 2 um is selected for mapping 
extended spontaneous coherence of excitons. The shift 6x = 2 tm is 
chosen to exceed both Agg and the PSF width. At such 6x, only weak 
coherence given by the PSF value at 6x = 2 um can be observed for a 
classical gas. The regions of enhanced coherence exceeding such a 
background level reveal the regions with extended spontaneous coher- 
ence of excitons. 

Next, we present a pattern of coherence length. The spatial exten- 
sion of g,(dr) can be characterized by a coherence length €. To consider 


g (um) 


gas with (solid) and without (dashed) convolution with the PSF. ¢, Distribution 
in momentum space obtained by the Fourier transform of g in b fora quantum 
(black line) and a classical (blue line) gas. See main text for details. 


all the points in the pattern on an equal footing, we evaluate ¢ as that 
value of dr at which the interference visibility drops e times. We mea- 
sured the exciton interference pattern at different dr to produce the 
spatial map of g,(6r) and, in turn, €. Figure 4a and b shows the pattern 
of € for the shift between the interfering excitons along x and y, 
respectively. Figure 4c and d presents the cross-sections of €(x, y) in 
the region of the polarization vortex (Fig. 4c) and MOES (Fig. 4d). 

The regions of a classical gas in the E(x, y) pattern correspond to the 
smallest observed coherence length, which is given by the PSF width. 
Long-range spontaneous coherence of excitons is observed in the 
polarization vortices and in the macroscopically ordered exciton state 
(Fig. 4). The coherence length in these regions is much larger than in a 
classical gas, indicating a coherent state with a much narrower-than- 
classical exciton distribution in momentum space, characteristic of a 
condensate. 

The observed coherence length in the polarization vortex exceeds 
Zap = 0.5 um at 0.1 K by more than an order of magnitude (Fig. 4). The 
coherence length in the MOES is smaller than in the polarization 
vortex. This may be related to fluctuations of the exciton density wave 
in the external ring. Such fluctuations were observed recently, and their 
studies will be reported in future work. 

The patterns of coherence length are different for the shifts along x 
and y, revealing a directional property of exciton coherence. In the 
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Figure 4 | Patterns of the coherence length of excitons, ¢(x, y). a,b, €(x, y) for 
a shift along x (a) and y (b). ¢, d, Cross-section of €(x, y) through LBS 

(y = —90 tum; c) and along the external ring (x = —70 jum; d) for a shift along x 
(red) and y (black). Tyan = 0.1K. 
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region of the polarization vortices, ¢ is higher in the direction along the 
shift between the interfering excitons (that is, the x direction for the dx 
shift (Fig. 4a), and the y direction for the dy shift (Fig. 4b)). In the 
region of the MOES, € is higher for the dx shift (that is, for the shift 
along the direction of exciton propagation away from the external ring 
in Fig. 4a). These data indicate that the extension of gi(r) is greater 
when the exciton propagation direction is along vector r. 

Finally, we present observations of phase singularities. A well 
known example of a phase singularity is a quantized vortex. Ina singly 
quantized vortex, the phase of the wavefunction winds by 27 around 
the singularity point, which can be revealed as a fork-like defect in a 
phase pattern. Fork-like defects in interference patterns have been 
reported for optical vortices, vortices in atom condensates, and polariton 
vortices (see refs 28-30 and references therein). 

We observed a number of fork-like defects in the interference pattern 
ofa cold exciton gas. For example, more than 20 such defects are present 
in Fig. le. Information on an enlarged scale is presented in Fig. 5. 
Figure 5a shows forks in the interference pattern, indicating the pres- 
ence of phase singularities. We have studied the properties of this phe- 
nomenon and show below that its origin is different from that of a 
quantized vortex. 

The forks in the interference pattern are observed at low tempera- 
tures in a quantum exciton gas (Fig. 5a) and vanish at high tempera- 
tures in a classical exciton gas (Fig. 5d). At low temperatures, a closed 
contour around the fork crosses an odd number of interference fringes, 
so the phase of the interference fringes winds by 21, indicating a phase 
singularity (Fig. 5e). Similar properties are observed for quantized 
vortices. 

However, the distance between the left- and right-facing forks in the 
interference pattern is different from the shift 6x = 2 jim in the shift- 
interferometry experiment, and depends on the excitation power 
(Fig. 5a-c). This indicates the difference between the observed phase 
singularity and a quantized vortex. Indeed, straightforward simulations 
show that a quantized vortex is characterized by a pair of opposite forks 
separated by a distance equal to the shift in the shift-interferometry 
experiment (see Supplementary Information). 

Simulations of the interference pattern produced by a ring-shaped 
source, such as an LBS ring, result in an interference pattern with 
opposite forks separated by a distance much larger than 6x, in qualita- 
tive agreement with the experiment (see Supplementary Information). 
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Figure 5 | Fork-like defects in exciton interference patterns. 

a-d, Interference pattern Intex, y) around an LBS; for a—c, Tyan is constant 
(0.1 K) and P.,. varies (7, 1.2 and 0.7 mW, respectively); for a and d, P..x is 
constant (7 mW) and Tyatn varies (0.1 and 5K, respectively). e, Tinterr along a 
closed contour (shown in inset) for T,ath = 0.1 K (black) and 8 K (red). 

dx = 2 um. 
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A ring-shaped source with particles propagating away from their origin 
on the ring produces a more complicated phase pattern than a vortex, 
yet both objects are characterized by the spreading of particle velocities 
over all directions. The observed phase singularities constitute the 
properties of a quantum exciton gas with extended spontaneous coher- 
ence (Fig. 5a), and no such phase singularity is observed at high tem- 
peratures in a classical gas (Fig. 5d). 


METHODS SUMMARY 


Experiments are performed on an* —i—n* GaAs/AlGaAs CQW structure. The i 
region consists of a single pair of 8-nm GaAs quantum wells separated by a 4-nm 
Alo.33Gao.67As barrier and surrounded by 200-nm Alp.33Gao¢7As barrier layers. 
The n° layers are Si-doped GaAs with Si concentration Ns = 5 X 10'7cm *. The 
electric field in the z direction is controlled by the external voltage applied between 
n* layers. The small in-plane disorder in the CQW is indicated by the emission 
linewidth of 1 meV. Long lifetimes of the indirect excitons allow them to cool to 
temperatures within about 0.1 K of the lattice temperature, which can be lowered 
to 0.1K in an optical dilution refrigerator. This allows the realization of a cold 
exciton gas with temperature well below the temperature of quantum degeneracy, 
which is in the range of a few kelvin for typical exciton densities ~10'° cm ? for 
the CQW’. The laser excitation is performed by a HeNe laser at excitation wave- 
length /.. = 633 nm with an excitation power P. = 1.2 (2.9) mW for the data in 
Figs 1-3 (Fig. 4). The photoexcitation is more than 400 meV above the energy of 
indirect excitons, and the 10-j1m-wide excitation spot is farther than 80 um away 
from both the LBS and the external ring. The x-polarization is along the sample 
cleavage direction within the experimental accuracy. The data have been acquired 
on a timescale of the order of 100s. During the measurements, the LBS rings are 
static while the exciton density wave in the external ring fluctuates ona length scale 
of few micrometres (such fluctuations may be responsible for a smaller coherence 
length in the MOES than in the polarization vortex, as discussed in the text). The 
coherence length is generally smaller than the distance between the LBS and 
MOES, or between different LBS. The spatial resolution of the optical system is 
-=).5 pm, 
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Endospore abundance, microbial growth and 
necromass turnover in deep sub-seafloor sediment 


Bente Aa. Lomstein', Alice T. Langerhuus', Steven D’Hondt’, Bo B. Jorgensen® & Arthur J. Spivack? 


Two decades of scientific ocean drilling have demonstrated wide- 
spread microbial life in deep sub-seafloor sediment, and surprisingly 
high microbial-cell numbers. Despite the ubiquity of life in the deep 
biosphere, the large community sizes and the low energy fluxes in 
this vast buried ecosystem are not yet understood’. It is not known 
whether organisms of the deep biosphere are specifically adapted to 
extremely low energy fluxes or whether most of the observed cells are 
in a dormant, spore-like state’. Here we apply a new approach—the 
D:L-amino-acid model—to quantify the distributions and turnover 
times of living microbial biomass, endospores and microbial 
necromass, as well as to determine their role in the sub-seafloor 
carbon budget. The approach combines sensitive analyses of unique 
bacterial markers (muramic acid and D-amino acids) and the 
bacterial endospore marker, dipicolinic acid, with racemization 
dynamics of stereo-isomeric amino acids. Endospores are as abund- 
ant as vegetative cells and microbial activity is extremely low, lead- 
ing to microbial biomass turnover times of hundreds to thousands 
of years. We infer from model calculations that biomass production 
is sustained by organic carbon deposited from the surface photo- 
synthetic world millions of years ago and that microbial necromass 
is recycled over timescales of hundreds of thousands of years. 

Deep sub-surface sediment material was obtained from the eastern 
tropical Pacific during the Ocean Drilling Program (ODP) Leg 201 
expedition with DS JOIDES Resolution. Coring sites ranged from the 
continental shelf off the coast of Peru to ocean depths of 5,000 m. The 
expedition recovered sediment at depths of up to 420 metres below the 
sea floor (mbsf), and this sediment was found to be up to 35 million 
years old. 

Wereport estimates of high bacterial endospore numbers in the order 
of 10’ endospores per cm? (Fig. 1) in deep sediment and sediment that is 
up to 10 million years old. We used two different analyses to quantify 
endospores: muramic acid, which is a unique building block in cell walls 
of both bacteria and endospores; and dipicolinic acid, which is uniquely 
formed by endospores. In the first analysis, muramic acid of endospores 
is calculated as the total muramic acid (Supplementary Fig. 1) minus 
muramic acid from vegetative cells. Vegetative cells (and intact but 
recently dead cells) were quantified by acridine orange direct counts 
(AODCs)*. Endospore muramic acid is converted to endospore 
numbers by the use of cell-specific conversion factors that are obtained 
from the literature (Supplementary Information). It remains unclear 
whether bacteria or archaea predominate in the studied sediment>®. 
We therefore use two extreme scenarios for our estimates of endospore 
abundance, either that bacteria completely dominate® (bacterial 
dominance) or that 90% are archaea’ (archaeal dominance). Archaea 
do not contain muramic acid in the cell wall. We also assume that 35% 
of the bacteria are Gram positive and 65% are Gram negative, as was 
found throughout the sediment column at Site 1229 (ref. 7). 

Muramic-acid-based endospore numbers are 0.2 X 10’ to 3 X 10” 
endospores per cm*. Archaeal or bacterial dominance did not affect 
the estimated endospore numbers because muramic acid levels 
calculated from AODC only marginally contributes to the measured 


muramic acid concentrations (Fig. 1). At Site 1227 we also estimated 
endospore numbers from dipicolinic acid concentrations assuming an 
average dipicolinic-acid content of 2.2 x 10° '®molspore’' (ref. 8). 
Dipicolinic-acid-based endospore numbers are 0.3 X 10’ to 1.0 X 10” 
endospores per cm’ and confirmed the muramic-acid-based estimates 
by a mean deviation factor of 4.6. There was a significant positive 
correlation between the two endospore estimates from muramic acid 
and dipicolinic acid as judged from a statistical analysis of their depth 
trend (regression of muramic acid and dipicolinic acid; P = 0.0008; 
R= 0.7297). To our knowledge, this is the first time that such com- 
parative data are published for endospores in environmental samples. 

This discovery of high endospore abundances raises the question of 
why endospores have not been detected previously and whether total 
cell abundance in the deep biosphere, including endospores, has been 
globally underestimated. Previous studies of the deep biosphere did not 
quantify endospores. Endospores are unlikely to be stained by fluor- 
escent DNA dyes such as acridine orange”’® or by ribosomal RNA 
staining techniques such as catalysed reporter deposition fluorescence 
in situ hybridization because endospore walls are impermeable’. In this 
study we processed samples with 3 N HCl (for 4h at 95 °C), which com- 
pletely extracts muramic acid and dipicolinic acid from endospores"’. 
Our results suggest that endospores are as abundant as vegetative 
prokaryotes in this deep marine biosphere. Studies in other oceano- 
graphic regions are needed to clarify how the earlier global estimate of 
3.5 X 10°” cells’? should be adjusted to account for this. 
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Figure 1 | Profiles of AODCs and estimated endospore numbers on the 
Peruvian continental shelf (sites 1227 and 1229) and in the Trench in Peru 
(site 1230). Filled circles, AODC’; coloured open squares, muramic-acid- 
based estimated number of endospores (bacterial dominance); crosses, 
muramic-acid-based estimated number of endospores (archaeal dominance); 
black open squares, dipicolinic-acid-based estimated number of endospores 
(n = 6). Grey shading, sulphate—-methane transition zones. Error bars, s.d. 
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It has been suggested that bacterial necromass production is 
responsible for enhanced organic matter preservation in highly pro- 
ductive regions such as the upwelling system off the coast of Peru’. 
Hence, whether microbial necromass is a major component’? or an 
insignificant component" of sedimentary organic matter is an import- 
ant question. We address this question by comparing the total mea- 
sured amino acid carbon with the amino acid carbon estimated from 
vegetative cells and endospores, based on published cellular protein 
contents (Supplementary Information) and AODCs. Independent of 
whether bacteria or archaea dominate the microbial community, 
vegetative cells and endospores contribute less than 3% and 1% to 
amino acid carbon, respectively (Supplementary Fig. 2). As expected, 
microbial necromass accounts for the remaining >96% of amino acid 
carbon. The concentration of bulk organic matter progressively 
decreases with sediment depth and age. Amino acids contribute 
4-5% to the total organic carbon in the youngest sediment and 
<0.1% in sediment that is several millions of years old (Supplemen- 
tary Fig. 3), thus showing the preferential degradation of amino acids 
compared to bulk organic matter. Hence, the down-core decrease in 
amino acid concentrations is the result of a dynamic amino acid turn- 
over in which amino acid degradation slightly exceeds amino acid 
formation. Our data suggest that the original amino acids that are 
derived from the water column are degraded soon after deposition 
and that sub-surface amino acids are produced in situ by new micro- 
bial biomass formation. 

Almost all of the amino acids in living cells are present in the 
L-isomeric configuration. The only known exception is in bacteria, 
as they uniquely produce specific D-amino acids for incorporation into 
their cell-wall complex'’. In marine sediment there is an additional 
source of D-amino acids derived from racemization, which is the slow 
stereo-chemical conversion of L-amino acids to D-amino acids and 
vice versa (see Supplementary Information). Bacterial cultures isolated 
from the deep sub-seafloor have been screened for D- and L-forms of 
aspartic acid. The D:L-Asp ratios are relatively constant with an average 
of 0.09 (s.e.m. = 0.02) (Supplementary Table 2). The half-life of abiotic 
chemical racemization (D:L = 0.33 is the ratio obtained when chemical 
racemization is half way to unity’®) is 0.1-0.4 million years for Asp at 
the in situ temperature (Fig. 2a and Supplementary Information). 

D:L-Asp ratios for sites 1227, 1229 and 1230 are higher than D:L-Asp 
ratios in cultures and increase in sediment that is older than 4 million 
years (Fig. 2a). This indicates that D-Asp is produced both biologically 
and by chemical racemization in sediment that is up to 10 million years 
old. Thus, the turnover time of Asp in the sediment is of a similar 
magnitude to the half-life for chemical racemization between L-Asp 
and p-Asp. 


Constant low D:L ratios of Asp have been reported previously from 
vent systems’” and Holocene-epoch sediments”. In those studies low 
D:L ratios were considered evidence of bacterial activity, but the data 
were not used quantitatively. 

We present a D:L model from which the turnover time of amino 
acids in living cells and microbial necromass, as well as their role in 
sedimentary carbon cycling, can be quantified (Fig. 2b; see the 
mathematical model in Supplementary Information). Inputs to the 
model are the measured mean D:L-Asp ratios in cultured bacteria from 
the deep biosphere (Fig. 2a and Supplementary Table 2), the fraction of 
Asp associated with living microorganisms (Supplementary Informa- 
tion), and the racemization half-life of the Asp'®. We use Asp because 
the D:L-Asp ratio in bacterial cultures varies within a limited range 
(Fig. 2a) and because all of our measured D:L-Asp ratios deviate from 
the average ratios in bacterial cultures. Such a deviation is a prerequisite 
for accurate model calculations of turnover time. 

The primary output of the D:L model is the turnover time of micro- 
bial necromass, which can be used to model microbial biomass turn- 
over time (Supplementary Information equation (25)). A quasi-steady 
state in microbial biomass is assumed because AODC cell numbers are 
relatively constant within the 5-m intervals used in the model (Fig. 1). 
Total carbon oxidation rates of amino acids and uncharacterized total 
organic carbon, and the total carbon incorporation into microbial 
biomass (Fig. 2b), are estimated from necromass degradation rates 
and an assumed growth yield (carbon incorporation efficiency). In 
our calculations we use an 11% growth yield, which is the average 
growth yield for anaerobic heterotrophic bacteria’’. At steady state, 
when microbial biomass production is equal to necromass degradation 
(Supplementary equation (5)), carbon must be incorporated into 
vegetative cells from uncharacterized total organic carbon to maintain 
balance (Fig. 2b). All model estimates are performed for two extremes 
in microbial community composition; that is, for bacterial dominance 
and archaeal dominance. 

The biomass turnover time, T,, of vegetative cells is equal to cell 
carbon divided by the rate of cellular carbon incorporation. We use 
biomass-turnover time instead of generation time, because we do not 
know whether the cells are dividing (as indicated by the term genera- 
tion time) or whether they are turning over their biomass and persist 
without dividing. The T, values of vegetative cells estimated from the 
D:L model are less than 0.2 X 10° to 4.0 X 10° years, irrespective of 
whether bacteria or archaea dominate (Fig. 3a). These results are 
supported by estimates of the T;, of sulphate-reducing bacteria (SRB; 
less than 0.1 X 10° to 1.5 X 10° years) at sites 1227 and 1229 (with one 
exception at site 1229 at which the T, is 12 X 10° years; Fig. 3b). The T, 
of SRB is estimated from sulphate-reduction rates modelled from 
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Figure 2 | p:L-Asp model. a, b, Turnover of amino acids in microbial biomass 
and necromass are quantified by application of the time constant (k;) for 
chemical racemization of Asp in necromass and the known constant D:L-Asp in 
bacteria. a, D:L-Asp versus assigned age of the sediment (Supplementary 
Information): orange, site 1227; blue, site 1229; green, site 1230. Mean D:L-Asp 
in bacterial cultures (dashed line in the shaded box; Supplementary Table 2) 
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Uncharacterized 
organic matter 


and s.e.m. (shaded box; n = 7). Curves show D:L-Asp generated by chemical 
racemization at the highest (solid line) and lowest (dotted line) temperatures 
encountered. b, Visualizes additional outputs of the D:L model: necromass 
production, necromass degradation, carbon oxidation (C-ox) and biomass 
production rates. 
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Figure 3 | D:L-model estimates of microbial biomass turnover times, 

necromass turnover times and carbon-oxidation rates. a, Microbial-biomass 
turnover times . b, Comparison between D:L-modelled biomass turnover times, 
and modelled biomass turnover times of sulphate reducing bacteria (crosses). 


sulphate profiles and a 13% contribution of SRB to the microbial 
community in the sulphate zone’ (Supplementary Information). 
The two independent estimates of T,,, from D:L modelling and from 
sulphate modelling, show almost identical down-core changes at sites 
1227 and 1229 (Fig. 3b), and this underscores the capability of our new 
D:L modelling approach. At site 1230 we have only two estimates of Ty, 
based on sulphate reduction and they were considerably shorter than 
the D:L modelled T,, (data not shown). 

Previous estimates of microbial generation times in deep sub- 
seafloor sediment are in the range of 1 X 10° to 2 X 10° years (ref. 
12). These estimates are based on the rates of organic carbon supply 
and an assumed 20% growth yield. If an 11% growth yield is applied, as 
in our D:L model, this gives longer generation times of 2 X 10° to 
3X 10° years, and these generation times are still consistent with 
results of the present study. Irrespective of the approach used to estimate 
mean generation times in the deep sub-seafloor, these times are far 
longer than the generation times found in any other ecosystem’. 

Sub-seafloor necromass is turned over on timescales that are100- 
fold longer than the turnover of living biomass. Necromass turnover 
times are in the range of 0.4 X 10° to 4.0 X 10° years, irrespective of 
whether bacteria or archaea are assumed to dominate. This is still 10- 
100-fold shorter than the age of the oldest sediment (6.9 million years; 
Fig. 3c) and is consistent with our assumption of a quasi-steady state. It 
has been suggested that bacterial necromass may be an important 
source of recalcitrant, and hence preserved, organic matter in sedi- 
ments of high-productivity regions'*”'”’. This apparent contradiction 
between previous studies and our results is probably due to differences 
in timescales among studies, which range from less than a few thousand 
years in earlier studies'**’” to several millions of years in the present 
study. Hence, preservation of sub-seafloor necromass ona shorter time- 
scale does not preclude efficient mineralization on a longer timescale. 


Table 1 | Estimated carbon oxidation rates at ODP Leg 201 sites 
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c 
Microbial necromass 


turnover time (yr) 
10+ 105 10° 


Carbon oxidation 
(nmol cm-3 yr-) 


2 4 6 


c, Necromass turnover times. d, Carbon-oxidation rates. Orange, site 1227; 
blue, site 1229; green, site 1230. Open circles, bacterial dominance; filled 
triangles, archaeal dominance of the microbial community (the two symbols 
overlap). 


Atall sites, p:L-Asp-modelled carbon oxidation rates decrease steeply 
in the upper part of the sediment (Fig. 3d). In the deeper part of site 1230 
(70-200 mbsf), carbon oxidation rates remain relatively high compared 
to sites 1227 and 1229 (Fig. 3d). Site 1230 is located in the subduction 
zone and differed from the two other sites by the presence of gas 
hydrates at depths below 70 mbsf (ref. 23). The accumulation of 
methane hydrate and the more compressed sulphate-methane zona- 
tion at site 1230 compared to the other sites indicate that site 1230 has a 
higher carbon and energy flux in the deeper parts of the sediment. 

Areal carbon oxidation rates in the sulphate zone are 1-9 mol 
cm” yr ' (irrespective of bacterial or archaeal dominance) based 
on D:L modelling. These rates are remarkably similar to independently 
determined carbon oxidation rates based on sulphate profiles’ (2-5 
mol cm * yr_'; Table 1). This supports the validity of the p: model 
and gives confidence that the model is reliable also in zones below the 
depth of sulphate depletion. 

The growth yields of the sulphate-reducing community can be 
estimated from biomass-production rates derived from D:L modelling, 
a 13% contribution of sulphate-reducing bacteria to the microbial 
community”® and the carbon flux (carbon oxidation rate plus carbon 
incorporation into microbial biomass) to the sulphate-reducing 
bacteria, modelled from sulphate profiles (Supplementary equation 
(27)). The resultant average growth yield for sulphate-reducing 
bacteria is 13 + 8%, which is in agreement with the average growth 
yield of 11% derived from data in the literature and applied to the 
entire bacterial community. At the exceedingly low metabolic rates 
found in the present study (Fig. 3) it is unknown whether this growth 
yield leads to net growth and cell division or instead to turnover of the 
microbial-cell mass without division. 

It is generally assumed that microbial communities in deep sub- 
surface sediments primarily rely on energy from the photosynthetic 


ODPLeg201 Water Relevant depth Potential C-ox by Integration depth p:L-modelled C-oxin D:L modelled C-ox Integrationdepth Total p:L-modelled Total p:L-modelled 

site depth (m) interval for net sulphate (D:L-modelled sulphate zone in sulphate zone (total D:.-modelled — C-ox (bacterial C-ox (archaeal 
sulphate reduction? C-ox in sulphate (bacterial (archaeal C-ox) (mbsf) dominance)* dominance)+ 
reduction (umol cm~? yr) zone (mbsf)) dominance)* dominance) + (umol cm~? yr“) (umol cm~? yr!) 

(mbsf) (umol cm~? yr~1) (mol cm~2 yr-1) 

1227 427 1-34 Ls 5-35 3.9 4.0 5-85 6.6 6.7 

1229 151 1-30 18¢ 5-30 8.9 9.0 5-80 18.1 18.4 

1230 5,086 1-10 5.0 -10 1.2 12 5-200 21.3 214 


*It is assumed that all living cells belong to Bacteria 
+ lt is assumed that 10% of all living cells belong to the Bacteria and 90% to the Archaea 


{At site 1229 carbon oxidation (C-ox) by sulphate reduction was assumed to be similar to that at site 1227 because sulphate penetrated to approximately the same depth at the two sites. 
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surface world’. We estimate the minimum turnover time of the buried 
total organic carbon pool by dividing the total organic carbon concen- 
tration by the D:L-modelled carbon oxidation rates at all depths and 
sites. The total organic carbon turnover time is less than 43 million 
years, irrespective of whether bacteria or archaea dominate the microbial 
community. The total organic-carbon turnover times increase with 
approximately an order of magnitude from the upper and youngest 
sediment layers to the base of the cores. This suggests that buried organic 
carbon is indeed sufficient to fuel microbial activities over timescales of 
millions of years. 


METHODS SUMMARY 

Sediment handling and analysis. Samples were obtained from sites 1227, 1229 
and 1230 of ODP Leg 201 (ref. 23). Sub-samples were taken from the central part 
of the cores immediately after these were brought on deck. Samples were stored at 
—20°C until analysis. Concentrations of muramic acid, D- and L-Asp, and indi- 
vidual amino acids were analysed by high-performance liquid chromatography 
(HPLC) following the method described in a previous study” with the exception 
that in cases in which one of the samples was spiked, muramic acid was analysed 
twice in each sample. Dipicolinic-acid concentrations were analysed by HPLC and 
calibrated by standard additions to samples”. Total organic carbon was deter- 
mined in a Carlo Erba analyser after removal of carbonates by acidification with 
H2SO; (5-6% w/w). Total organic carbon was estimated from five-point standard 
curves and flour containing 44.39% carbon was used as a standard. 
Cell-specific concentrations of D- and L-amino acids. Seven pure cultures, iso- 
lated from ODP Leg 201, were analysed for D- and L-Asp. Cultures were washed 
twice in saline before acid hydrolysis to remove amino acids from the growth 
medium. Cultures (2 ml) were added to 2 ml of 12 moll ' HCl and hydrolysed 
at 105 °C for 24h under N>. Concentrations of the respective D- and L-amino acids 
were analysed as described earlier. Cell numbers in the different cultures were 
determined by epifluorescence microscopy after DNA staining with 4'-6- 
diamidino-2-phenylindole. 

Standardization of data. Samples for the present study were not obtained from 
exactly the same depths as AODC*. Hence, concentrations were read from profiles 
at the depth of AODC to make comparisons to AODC. No attempt was made to 
standardize profiles in the lower part of site 1229 because our samples did not 
cover the depths of elevated AODC. Moisture and density data were obtained from 
the IODP database (http://iodp.tamu.edu/janusweb/physprops/maddat.cgi). 
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Wild-type microglia arrest pathology in a mouse 


model of Rett syndrome 


Noél C. Derecki’*, James C. Cronk, Zhenjie Lu', Eric Xu, Stephen B. G. Abbott?, Patrice G. Guyenet? & Jonathan Kipnis?? 


Rett syndrome is an X-linked autism spectrum disorder. The disease 
is characterized in most cases by mutation of the MECP2 gene, which 
encodes a methyl-CpG-binding protein’°. Although MECP2 is 
expressed in many tissues, the disease is generally attributed to a 
primary neuronal dysfunction®. However, as shown recently, glia, 
specifically astrocytes, also contribute to Rett pathophysiology. Here 
we examine the role of another form of glia, microglia, in a murine 
model of Rett syndrome. Transplantation of wild-type bone marrow 
into irradiation-conditioned Mecp2-null hosts resulted in engraft- 
ment of brain parenchyma by bone-marrow-derived myeloid cells of 
microglial phenotype, and arrest of disease development. However, 
when cranial irradiation was blocked by lead shield, and microglial 
engraftment was prevented, disease was not arrested. Similarly, 
targeted expression of MECP2 in myeloid cells, driven by Lysm“° 
on an Mecp2-null background, markedly attenuated disease symp- 
toms. Thus, through multiple approaches, wild-type Mecp2- 
expressing microglia within the context of an Mecp2-null male 
mouse arrested numerous facets of disease pathology: lifespan was 
increased, breathing patterns were normalized, apnoeas were 
reduced, body weight was increased to near that of wild type, and 
locomotor activity was improved. Mecp2*'~ females also showed 
significant improvements as a result of wild-type microglial engraft- 
ment. These benefits mediated by wild-type microglia, however, 
were diminished when phagocytic activity was inhibited pharmaco- 
logically by using annexin V to block phosphatydilserine residues 
on apoptotic targets, thus preventing recognition and engulfment 
by tissue-resident phagocytes. These results suggest the importance 
of microglial phagocytic activity in Rett syndrome. Our data 
implicate microglia as major players in the pathophysiology of this 
devastating disorder, and suggest that bone marrow transplanta- 
tion might offer a feasible therapeutic approach for it. 

The role of glia in Rett syndrome has recently been recognized’ °. 
Mecp2-null astrocytes were unable to support the normal dendritic 
ramification of wild-type neurons growing in culture’, and expression of 
wild-type MECP2 protein in astrocytes of Mecp2-null hosts dramatically 
ameliorated disease pathology’. Mecp2-null microglia were reported 
to be toxic to neurons in vitro through production of high levels of 
glutamate’®. 

Microglia, the brain-resident macrophages, are of haematopoietic 
origin". Still, the idea of repopulation of brain microglia from bone- 
marrow-derived cells in adult mice under normal physiological 
conditions is controversial’. However, when bone marrow trans- 
plantation is preceded by irradiation-mediated immune ablation, 
bone-marrow-derived cells with microglia-like morphology and 
phenotype (expressing low levels of CD45 and high levels of CD11b) 
are found in the brain'*"*. Microglia have received increasing attention 
in the pathophysiology of several neurodegenerative and neuro- 
psychiatric diseases'*”"*. 

We first studied microglial function in the context of Mecp2~” male 
mice. Males possess a single mutant X chromosome, and thus manifest 


a severe phenotype that includes markedly retarded growth, apnoeas, 
tremor, impaired gait and locomotor function, and a postnatal life 
expectancy of approximately 8 weeks*’? (Fig. 1a, b and Supplemen- 
tary Movie 1). 

To address the role of hematopoietically derived cells in the  patho- 
physiology of Rett, Mecp2~”” (Mecp2'""" and Mecp2'"?®'"") mice 
were subjected to lethal split-dose irradiation at postnatal day (P)28 
(the approximate age at which neurological signs appear*). Mice were 
then injected intravenously with syngeneic bone marrow from C57B1/6J 
mice ubiquitously expressing green fluorescent protein (GFP). Control 
groups were injected with autologous (Mecp2-”) bone marrow, or left 
naive. Surprisingly, the lifespan of Mecp2-mull recipients of wild-type 
bone marrow (wild type — Mecp2 ’”) was significantly extended 
compared with Mecp2~”” mice receiving autologous bone marrow 
(Mecp2-'” > Mecp2~”) or to naive Mecp2-”” mice (Fig. 1b and Sup- 
plementary Movie 2). Our oldest living wild-type > Mecp2~”” mice 
were over 48 weeks of age (Supplementary Movie 3); most experimental 
mice were euthanized at the age of ~16 weeks for tissue analysis. 

Although Mecp2~’” mice on the C57BI/6J background are under- 
sized*"’, within 4 weeks of transplantation, wild-type > Mecp2 ¥ (but 
not Mecp2 '” > Mecp2 '”) mice approached the size of wild-type 
littermates (Fig. 1c, d). Wild-type > Mecp2-” mice also exhibited 
an increase in brain weight (Fig. le, f), which was probably caused 
by general growth of the mouse, because the reduced soma size char- 
acteristic of Mecp2-null neurons was not changed by bone marrow 
transplantation (Fig. 1g, h) and the brain to body weight ratio was 
normalized (Supplementary Fig. 1a). Spleens of Mecp2 ’” mice were 
also smaller than normal and their size normalized after transplanta- 
tion (Supplementary Fig. 1b-d). 

Growth retardation is a characteristic feature of Rett pathology. Along 
these lines, treatment with insulin-like growth factor (IGF)-1 benefits 
survival and behavioural outcomes in Mecp2-null mice’. Indeed, 
peripheral macrophages from wild-type mice expressed significantly 
higher levels of IGF-1 in vitro in response to immunological stimuli than 
macrophages from Mecp2-null (Mecp2°"'*”) mice (Supplementary 
Fig. 2); this difference may contribute to the increased body growth seen 
in Mecp2-null mice after bone marrow transplantation. 

The general appearance of wild-type > Mecp2™'” mice was 
improved compared with that of naive Mecp2-’” or Mecp2.'” > 
Mecp2~”” mice. The severe involuntary tremors normally seen in 
mutant mice were absent following wild-type bone marrow trans- 
plantation (Fig. 2a), and gait was improved. Interestingly, no detect- 
able benefit on hindlimb clasping phenotype was observed. 

Breathing irregularities and apnoeas are cardinal signs of Rett 
syndrome. We used whole-body plethysmography to compare the 
breathing patterns of Mecp2~”” mice with or without bone marrow 
transplantation to those of control mice (Fig. 2b). As expected, 
Mecp2-’” mice developed apnoeas progressively with age (data not 
shown). However, wild type—> Mecp2 ” exhibited significantly reduced 
apnoeas and fewer breathing irregularities than either naive Mecp2-”” or 
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Figure 1 | Wild-type bone marrow transplantation (BMT) at P28 arrests 
disease progression in Mecp2 '’ mice. a, Representative images of wild-type 
and Mecp2 ” littermates at P56. b, Lifespan of Mecp2’” mice receiving wild- 
type bone marrow at P28 (wild type—> Mecp2~’; n = 15) compared with naive 
Mecp2 ” (n= 17), Mecp2 ” receiving Mecp2 ” bone marrow (Mecp2 ” > 
Mecp2”; n = 9) and wild-type mice receiving wild-type bone marrow (wild 
type > wild type; n = 29) (***P < 0.0001, log rank (Mantel-Cox)). 

c, Representative images of wild-type > wild-type compared with wild-type > 
Mecp2.”” mice are shown at P56 (4 weeks after bone marrow transplantation). 
d, Weights (mean + s.e.m.) of wild-type —> wild-type, Mecp2 ”, Mecp2 '” > 
Mecp2 '” and wild-type + Mecp2 ” mice (n = 15, 15, 7, 15 mice per group) 
are shown over time. e, Representative images of brains isolated from P56 wild- 
type > wild-type and wild-type > Mecp2 ’” mice transplanted at P28 and 
naive Mecp2 ” mice. f, Brain weight (mean + s.e.m.) for each group 

(***P < 0.001; one-way analysis of variance (ANOVA); n = 4 each group). 

g, Representative images of Nissl staining in hippocampal slices (CA1 area) of 
wild-type and Mecp2 '” mice (scale bar, 40 um). h, Soma area (mean + s.d.) of 
CA1 hippocampal cells. For each group of mice, a set of cells was created by 
randomly selecting 100 cells per mouse, three mice per group (***P < 0.001; 
two-way ANOVA with Bonferroni post hoc test). 


Mecp2-'” > Mecp2~”” mice (Fig. 2c, d). Wild-type > Mecp2~” mice 
also displayed significantly increased mobility in the open field com- 
pared with naive Mecp2-” or Mecp2'” > Mecp2-"” mice (Fig. 2e, f). 

We also performed bone marrow transplantation in heterozygous 
female mice at 2 months of age, and animals were examined at 9 
months. The disease in Mecp2*'~ mice develops slowly, with 
behavioural abnormalities becoming clear at 4-6 months of age. 
Weights of treated Mecp2*/~ mice were comparable to wild-type 
controls (Fig. 2g). Moreover, there was significant improvement in 
motor function, as examined on rotarod (Fig. 2h), and time spent in 
the centre of the open field arena was significantly increased (Fig. 2i). 
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Figure 2 | Bone marrow transplantation effects on general appearance, 
breathing and locomotion of Mecp2~'” and Mecp2*’~ mice. a, Neurological 
scores at P56 for wild-type wild-type, Mecp2 '” naive, Mecp2 '”—> Mecp2 
and wild-type > Mecp2 ”” mice. Behaviours (mean + s.e.m.) are scored as 
indicated in Methods (***P < 0.001; one-way ANOVA; n1 = 16, 16, 7, 16); NS, 
not significant; a.u., arbitrary units. b, Left, representative plethysmograph 
recordings of animals from each group; right; expiratory time (Tg) for 
representative wild-type, Mecp2~”, Mecp2_” > Mecp2~”” and wild-type > 
Mecp2~” mice (transplantation at P28 and examination at indicated ages for all 
groups) as measured over 5 min; Ty is normalized to mean breath duration for 
each mouse. c, Apnoeas (mean = s.e.m.) per 30 min as measured in all four 
groups (***P < 0.001; one-way ANOVA with Bonferroni post hoc test; n = 5 
mice per group; for the entire figure, all mice were age P56 except for wild type—> 
Mecp2 "” at 12 weeks of age; that is, 8 weeks after bone marrow transplantation). 
d, Interbreath irregularity (mean percentage + s.e.m.) calculated as absolute 
((Trotahn — Ttotain + 1)/Ttotain +1) for all four groups (**P < 0.01; ***P < 0.001; 
one-way ANOVA with Bonferroni post hoc test; n = 5 mice per group). 

e, Distance travelled (mean + s.e.m.) in an open field (*P < 0.05; ***P < 0.001; 
one-way ANOVA, n = 5 mice per group). f, Representative traces of the path 
travelled by mice in an open field during 20 min test time. g-k, Mecp2‘’~ mice 
were transplanted with wild-type bone marrow at P56 and were examined for 
disease symptoms at 9 months of age. g, Weight (mean + s.e.m.); h, latency to 
fall (mean = s.e.m.) in the rotarod task; i, time (mean = s.e.m.) spent in the 
centre of the open field; j, apnoeas (mean + s.e.m.) measured by whole-body 
plethysmography in 30 min; k, interbreath irregularity (mean % ~ s.e.m.): all 
were improved in the treated mice compared with non-treated controls 

(*P < 0.05; ***P < 0,001; one-way ANOVA, n = 6 mice per group; post hoc 
Bonferroni test was used for individual comparisons). 


Apnoeas in bone marrow transplanted mice were reduced (Fig. 2j) and 
their overall breathing patterns were improved compared with their 
non-treated controls (Fig. 2k). 

The peripheral immune system of Mecp2 ”” hosts was repopulated 
by donor bone marrow (Supplementary Fig. 3a). Additionally, it has 
been previously shown that bone marrow transplantation after whole- 
body irradiation results in engraftment of microglia-like myeloid cells 
into the brain parenchyma”. Indeed, GFP™ cells in the parenchyma of 
bone marrow transplanted mice expressed CD11b (Fig. 3a) but not 
GFAP or NeuN (data not shown). Twelve weeks after bone marrow 
transplantation, mice were perfused and their brains dissected into 
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sub-areas, prepared in single-cell suspensions and analysed using flow 
cytometry to determine percentages of bone-marrow-derived (GFP ” ) 
cells in the haematopoietic (CD45*) cell fractions in the brain 
(Supplementary Fig. 3b, c). 

Interestingly, in mice in which bone marrow transplantation was 
performed later (P40 or P45), only slight improvements in disease 
pathology were observed (Supplementary Fig. 4a). No microglial 
engraftment was evident, although substantial numbers of GFP* cells 
were found in the meningeal spaces (Supplementary Fig. 4b). These 
results may suggest that when disease progression is faster than micro- 
glial engraftment, full rescue cannot be achieved. The moderate results 
observed, however, may have been due to a yet-unknown mechanism, 
perhaps through soluble factors produced by meningeal immune cells, 
or peripherally-expressed IGF-1 (Supplementary Fig. 2). When bone 
marrow transplantation was performed at P2 without irradiation, 
minimal peripheral chimaerism was achieved without detectable 
microglial engraftment and no lifespan extension was observed 
(Supplementary Fig. 4a, c). 

Newly engrafted microglia expressed detectable levels of wild-type 
Mecp2 (data not shown) but nearby cells did not show any Mecp2 
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labelling, arguing against the possibility of protein or messenger RNA 
transfer from engrafted microglia into nearby cells as an underlying 
mechanism for the beneficial effect of bone marrow transplantation. 

To substantiate the specific role of microglia in bone marrow 
transplantation-mediated disease arrest, we repeated transplantation 
experiments, again at P28, but with the addition of lead shielding to 
block cranial irradiation, which results in repopulation of peripheral 
immunity (Fig. 3b, c) but no parenchymal engraftment (Fig. 3d), sup- 
porting previously published work’*'*. Disease was not arrested in 
‘head-covered’ mice (Fig. 3e), suggesting that peripheral immune 
reconstitution without microglial engraftment is insufficient to arrest 
pathology in Mecp2~”” mice. 

To substantiate further the role of myeloid cells in arrest of Rett 
pathology, we used a genetic approach. We employed the widely 
used Lysm©"* mouse—which results in a high degree of recombina- 
tion in pevene cells, granulocytes and in significant numbers of 
microglia~*—in cross with Mecp2'**? mice. Male progeny, 
Mecp2 bsstp “’Lysm™, express wild-type Mecp2 in myeloid cells on an 
otherwise Mecp2-null background. These animals exhibited improve- 
ments in overall appearance and growth (Fig. 3f, g and Supplemen- 
tary Movie 4) and their lifespans were significantly increased (Fig. 3h). 
The oldest Mecp2'*°?'Lysm animals were 31 weeks of age, with 
survival of 100%; n =6 mice per group. Apnoeas and interbreath 
irregularity of these mice were also significantly reduced compared 
with control mice (Fig. 3i, j), and their open field activity was not 
significantly different from wild-type counterparts (Fig. 3k). These 
results cannot be interpreted by cre leakiness, because no cre-mediated 
recombination was evident in either astrocytes or neurons in Lysm‘* 
crossed to a reporter strain (data not shown), in line with previous 
publications”””’. 

Microglia from Mecp2-null mice were deficient in their response to 
immunological stimuli (Supplementary Fig. 5) and in phagocytic 
capacity, as examined by feeding cultured microglia with pre-labelled 
ultraviolet-irradiated neural progenitor cells, used as apoptotic targets™* 
(Fig. 4a—c). Thus, it is possible that apoptotic debris would accumulate 
over time in the Mecp2-null brain, contributing to neuronal malfunc- 
tion and accelerating disease progression. Along these lines, supple- 
mentation of wild-type microglia could reduce debris levels and allow 
improved neuronal function. Indeed, in mice transplanted with GEP* 


Figure 3 | Brain parenchymal engraftment of cells after bone marrow 
transplantation is required to arrest Rett syndrome. a, Representative 
confocal images of brain parenchyma from cerebellum of wild-type > 
Mecp2 ‘ ” mice 8 weeks after transplantation (at P28), immunolabelled for 
CD11b and GFP (scale bar, 20 ttm). b-e, Mecp2 ” mice underwent bone 
marrow transplantation at P28 with their heads lead-protected. Mice were 
examined at their end-point, about 7 weeks after bone marrow transplantation. 
b, Representative dot plot of splenocytes obtained from bone-marrow- 
transplanted mouse with lead-protected head. c, Distribution of ‘peripheral 
chimaerism’ in mice with lead-protected heads after bone marrow 
transplantation. d, Representative micrograph from mice with lead-protected 
heads after bone marrow transplantation, immunolabelled for GFP. Coronal 
cortical slice is presented showing sporadic cells found in meningeal spaces, but 
not in the parenchyma. e, Lifespan of Mecp2 ”” mice with wild-type bone 
marrow transplantation with lead-covered heads compared with wild-type > 
wild-type controls with lead-covered heads (***P < 0.0001, log rank (Mantel- 
Cox); n = 9 mice per group). f-k, Genetic approach for expressing MECP2 
protein in myeloid cells. Mecp2'°*°? mice were bred to Lysm“” mice and their 
progeny (Mecp2'°**"°?’Lysm“® mice) were analysed for disease progression. 
f, Representative image of mice at P56. g, Weights (mean + s.e.m.) of mice as 
they progress with age. h, Lifespan for indicated groups (***P < 0.0001, log 
rank (Mantel—Cox); n = 6 mice per group). i, Apnoeas (mean + s.e.m.) 
measured by whole-body plethysmography in 30 min for the three groups at 9 
weeks. j, Interbreath irregularity (mean % + s.e.m.) measured at 9 weeks. 

k, Distance travelled (mean + s.e.m.) in an open field at 9 weeks (**P < 0.01; 
* P< 0,001; one-way ANOVA, n = 5 mice per group; Bonferroni post hoc 
test was used for individual comparisons). 
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Figure 4 | Microglial phagocytic activity is necessary for their beneficial 
effect in Mecp2~'” mouse brains. a, Representative micrograph of 
phagocytosing microglia in orthogonal projections of confocal z-stacks. Scale 
bar, 25 jum. b, Wild-type (left) or Mecp2 ”’ (right) microglia incubated for 2 or 
5h with fluorescently labelled ultraviolet-irradiated neural progenitor cells and 
stained with anti-CD11b. Scale bar, 50 um. c, Bar graphs comparing surface 
area of ultraviolet-irradiated neural progenitor cells (NPC) to total surface area 
(mean + s.e.m.) of wild-type or Mecp2 ” microglia (**P < 0.01; 

* P< 0.001; one-way ANOVA; representative experiment shown out of three 
independently performed). d, Representative micrograph of phagocytosing 
microglia in situ containing cleaved caspase-3 debris co-localized with 
lysosomal marker, Lamp-1. Scale bar, 50 tum. e-g, Mecp2™s°P ”’LysmC* mice 
and the appropriate controls were treated with annexin V to inhibit phagocytic 
activity pharmacologically. e, Weights (mean + s.e.m.) of Mecp2*”’Lysm", 
Mecp2?*P' Lysm@", Mecp2*?", Mecp2*Lysm“® mice treated with 
annexin V and Mecp2°**"'Lysm\ mice treated with annexin V are shown at 
the end point for Mecp2'°*°?” and Mecp2’*"°PLysm\” treated with 
annexin V groups (~P63). f, Neurological scores (mean + s.e.m.) at P56 are 
presented (***P < 0.001; one-way ANOVA; n = at least seven mice per group; 
Bonferroni post hoc test was used for individual comparisons). g, Distance 
travelled (mean + s.e.m.) in an open field by mice from all the above groups 
(***P < 0.001; one-way ANOVA, 1 = 5 mice per group; Bonferroni post hoc 
test was used for individual comparisons). 


bone marrow, only GFP* parenchymal cells were consistently found 
containing cleaved caspase-3-positive debris within lysosomes (Fig. 4d). 

It has been previously shown that annexin V (a protein that binds 
phosphatidylserine on apoptotic cells and inhibits engulfment) 
injected intravenously can reach the central nervous system’. 
Moreover, we have recently shown that intravenous injection of 
annexin V results in substantial blockade of phagocytic activity in 
the brain**. Indeed, treatment of wild-type mice with annexin V 
resulted in significant accumulation of terminal deoxynucleotidyl 
transferase (TdT)-mediated dUTP nick end labelling (TUNEL) * frag- 
ments (Supplementary Fig. 6). 

We attempted to inhibit brain phagocyte activity pharmacologically 
in Mecp2'**°?Lysm* mice and compare disease progression with 
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controls. Long-term treatment of Mecp2°*“°??Lysm”* mice with 
annexin V abolished the amelioration of the disease normally seen 
in these mice (Fig. 4e-g). Wild-type mice treated with annexin V 
were not significantly affected. This is probably because, unlike in 
Mecp2’°**°P'Lysm* mice, neurons and astrocytes in wild-type mice 
are fully functional, expressing wild-type MECP2. It is conceivable, 
however, that a longer treatment of wild-type mice with annexin V 
might result in neurological pathology. Overall, these results suggest 
active engagement of wild-type microglia in clearance of apoptotic 
cells or cell remnants within the context of otherwise Mecp2-null 
brain—a task that probably cannot be sufficiently performed by 
Mecp2-null microglia. 

Neuropathologists have observed gliosis and cell loss in the cerebel- 
lum of deceased patients who had Rett syndrome 26 but this work has 
not received much attention, presumably because the disease is generally 
considered non-neurodegenerative. Our results do not claim that neuro- 
degeneration underlies the pathology of the disease. Rather, they suggest 
that Mecp2-null microglia, deficient in phagocytic function, may be 
unable to keep pace in clearing debris left behind from the normal 
process of neural cell death or membrane shedding. This, in turn, would 
lead to a crowded and sub-optimal central nervous system milieu within 
which neurons, already challenged by loss of Mecp2, might be further 
impaired in function. The inability of Mecp2-null microglia to clear 
debris as effectively as wild-type microglia has the potential to contribute 
to the underlying neuropathology and/or the time course of appearance 
of symptoms in Mecp2-null mice*”’. 

Future studies should be aimed at understanding the connections 
between glial phagocytic activity and neuronal function, and possible 
interactions between microglia and astrocytes in the pathology of Rett 
syndrome. Phagocytic activity per se is almost certainly just one aspect 
of glial involvement in the pathophysiology of the disease. It is con- 
ceivable that glia, including astrocytes—which are also capable pha- 
gocytes**—release soluble factors in connection with their own 
phagocytic activity, in turn benefiting neuronal function. Therefore, 
removal of debris itself may not be as primarily relevant to disease 
progression as a secondary response of glia to the phagocytic process. 
Accordingly, inhibition of phagocytosis might result in exacerbation of 
pathology of the disease through these as yet unknown processes, even 
in the absence of deposits of easily observable cellular debris. 

Our present findings support previous publications describing the 
potential for clinical treatment of Rett pathology*”’, while also suggest- 
ing the possibility of achieving this goal through augmentation or 
repopulation of brain phagocytes, or improvement of their phagocytic 
activity. These results open the possibility for a new approach in the 
amelioration of the pathology of Rett syndrome. 


METHODS SUMMARY 

Animals. Male and female C57BI/6-Tg(UBC-GFP)30Scha/J, C57BI/6J, 
B6.129P2(C)Mecp2"™F*/J, B6.129P2-Lyz2°™""""77_ (Lysm™*),  B6.129P2- 
Mecp2'™?84/y, Mecp2'**°P” and C57BL/6J mice were purchased from Jackson 
Laboratories; B6.Cg-Mecp2'™"1#"’M™«4 mice were a gift from A. Pieper and main- 
tained in our laboratory on C57BI/6J background. All procedures complied with 
regulations of the Institutional Animal Care and Use Committee at The University 
of Virginia. 

Irradiation and bone marrow transfer. Four-week-old mice were subjected to 
lethal split-dose y-irradiation (300 rad followed 48 h later by 950 rad). Four hours 
after the second irradiation, mice were injected with 5 X 10° bone marrow cells. After 
irradiation, mice were kept on drinking water fortified with sulphamethoxazole for 
2 weeks to limit infection by opportunistic pathogens. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Animals. Male and female C57BI/6-Tg(UBC-GFP)30Scha/J, C57BI/6J, 
B6.129P2(C)Mecp2'™! B47, B6.129P2-Lyz2'™ Mr) (Lysm"*),  B6.129P2- 
Mecp2'™8"4y, Mecp2!**!°?” and C57BL/6] mice were purchased from Jackson 
Laboratories; B6.Cg-Mecp2'™1*""M™<4 mice were a gift from A. Pieper 
(Southwestern Medical School, Dallas, Texas, USA) and were maintained in our 
laboratory on C57BI/6J background. All animals were housed in temperature and 
humidity controlled rooms, maintained on a 12 h/12 h light/dark cycle (lights on at 
7:00) and age-matched in each experiment. All strains were kept in identical 
housing conditions. All procedures complied with regulations of the 
Institutional Animal Care and Use Committee at The University of Virginia. 
Neurological assays. Mice were weighed on a laboratory scale and weights 
recorded to the nearest tenth of a gram; all other assays were scored on a scale 
from 3 to 1, with ‘3’ being wild type (normal), and ‘1’ being severe, as follows: gait 
‘l’, wide-spread hind limbs, severe ‘waddling gait,’ hopping or (unintentional) 
reverse locomotion; clasping ‘1’, clenching of both rear hindlimbs tightly across 
ventral body aspect; tremor ‘1’, visible involuntary shaking, particularly during or 
after handling; appearance ‘1’, greasy and/or unkempt fur, clear signs of severe 
self-neglect in terms of grooming; kyphosis, hunched posture; eyes sunken, partly 
or fully closed, watery or crusted over. Scores of ‘2’ were assigned to any pheno- 
types in between wild type and severe. 

Bone marrow isolation. Mice were killed using CO, saturated with 70% alcohol. 
Skin was removed from the lower part of the body. Tissue was removed from 
hindlimbs with scissors and dissected away from body. Remaining tissue was 
cleaned from the tibial and femoral bones and bones were separated at the knee 
joint. Bone ends were cut off. Cells were forced out of bones with a stream of pH 7.4 
0.1M PBS containing 10% fetal calf serum using a 10 ml syringe with 25 gauge 
needle. Cells were centrifuged and re-suspended at the concentration of 2 x 10” 
cells per millilitre in PBS (250 pil cell suspension was injected intravenously in each 
animal through the tail vein). 

Irradiation and bone marrow transfer. Four-week-old wild-type C57BI/6J, 
B6.129P2(C) MP BHAT !y, or B6.Cg-Mecp2'™! eM ™<4-ly mice were subjected 
to lethal split-dose y-irradiation (300 rad followed 48 h later by 950 rad). Four hours 
after the second irradiation, mice were injected with 5 x 10° bone-marrow cells 
freshly isolated from C57Bl/6-Tg(UBC-GFP)30Scha/J, CBySmn.CB17-Prkdc“4/J 
or B6.Cg-Mecp2'™#/Mm4—/Y mice, After irradiation, mice were kept on drinking 
water fortified with sulphamethoxazole for 2 weeks to limit infection by oppor- 
tunistic pathogens. 

Fluorescence-activated cell sorting (FACS) of brain isolates. Mice were per- 
fused with 0.1M PBS for 5 min. Heads were removed and skulls were quickly 
stripped of all flesh. Mandibles were next removed, as was all skull material rostral 
to maxillae. Surgical scissors (Fine Science Tools) were used to remove tops of 
skulls, cutting clockwise, beginning and ending inferior to the right post-tympanic 
hook. Brains were immediately placed in ice-cold FACS buffer (pH 7.4 0.1 M PBS, 
1mM EDTA, 1% BSA). Meninges (dura mater, arachnoid mater and pia mater) 
were carefully removed from the interior aspect of skulls and surfaces of brains 
with Dumont #5 forceps (Fine Science Tools). Brains were separated into sections 
(neocortex, hippocampus, striatum, cerebellum, brainstem) and sections from 
each group (n = 3 mice per group) were pooled. Brain tissue was homogenized 
with three gentle strokes in 10 ml FACS buffer in a 15 ml Tenbroeck Homogenizer 
(Lowell) then gently pressed through 70 tm nylon mesh cell strainers with sterile 
plastic plungers (BD Biosciences) to yield a single cell suspension. Cells were then 
centrifuged at 280g at 4°C for 10min, the supernatant removed and cells re- 
suspended in ice-cold FACS buffer. Myelin was removed using AutoMACS and 
myelin removal beads (Miltenyi Biotech). Cells were stained for extracellular 
markers with antibodies to CD11b conjugated to Alexa 780, CD45 conjugated 
to APC, CD4 conjugated to PE and MHC-II conjugated to Alexa 700 
(eBioscience). All cells were fixed in 1% PFA in 0.1 M pH 7.4 PBS. Fluorescence 
data were collected with a CyAn ADP High-Performance Flow Cytometer (Dako) 
then analysed using Flowjo software. To obtain equivalent and accurate cell 
counts, cells were gated first using the LIVE/DEAD Fixable Dead Cell Stain Kit 
as per the manufacturer’s instructions (Invitrogen), pulse width versus area to 
select singlet cells, forward scatter versus side scatter to eliminate debris, then by 
appropriate markers for cell type (for example, CD11b for myeloid-derived cells, 
or CD3 for T cells). Experiments were repeated twice with brains from n = 3 mice 
each group (total of n = 6 mice at 12 weeks after bone marrow transplantation). 
Floating section immunohistochemistry. Free-floating sections were incubated 
with 10% normal serum (either goat or chicken) for 1h at room temperature in 
PBS containing 0.1% Triton X-100 (Sigma), followed by incubation with appro- 
priate dilutions of primary antibodies (anti-CD11b (eBioscience), 1:100; anti- 
MECP2 1:200 (Cell Signaling Technology); anti-cleaved caspase-3 (Cell 
Signaling Technology) 1:300; anti-GFP (Abcam) 1:2,000; anti-GFAP (Abcam) 
1:1,000; anti-IGF-1 (R&D Systems) 1:100; anti-Lamp-1 (Abcam) 1:1000) for 


24-48 h at 4 °C in the same buffer but without serum. Sections were then washed 
for 10 min three times at room temperature in 0.1% Triton X-100 (Sigma) in PBS 
followed by incubation with Alexa-fluor 488, 546, 594, 633 or 647 chicken/goat 
anti-mouse/rat/goat/chicken IgG antibodies (1:1,000, Invitrogen) for 1 h at room 
temperature. After 30 s in 1:20,000 DAPI reagent, sections were washed again with 
0.1% Triton X-100 in 0.1M PBS (3 X 10 min) and mounted with Aqua-Mount 
(Lerner Laboratories) under coverslips. 

Analysis of Nissl staining. Three mice were analysed per group (wild type, 
Mecp2~"”, wild type > wild type, wild type  Mecp2~””). Three coronal brain 
sections containing hippocampus were selected per mouse. Sections were incu- 
bated in 0.1% Triton X-100 (Sigma) in PBS for 10 min at 25 °C. Two washes were 
performed in PBS for 5 min each at 25°C. Sections were then incubated in 1:40 
Neurotrace Fluorescent Nissl Stain 530/615 Red (Molecular Probes), diluted in 
PBS, for 20 min at 25°C. Sections were again incubated in 0.1% Triton X-100 
(Sigma) in PBS for 10 min at 25 °C, and two washes were performed in PBS for 
5 min each at 25 °C. After a final wash, sections were mounted on glass slides using 
Aqua-Mount (Thermo Scientific), coverslipped and visualized by confocal micro- 
scopy. Each hippocampal CA1 region (two per tissue section) was imaged, and 
quantified using the free hand tool in ImageJ. A Wacom computer graphics tablet 
was used to outline the somas of CA1 neurons, and the ‘Measure and Label’ 
function in Image] was used to quantify the area of each soma. After quantifica- 
tion, 100 soma measurements were randomly selected per mouse. This resulted in 
data sets of 300 soma measurements per group. GraphPad Prism was used to 
calculate two-way ANOVAs with Bonferonni post-hoc test. 

Phagocytosis assay and analysis. To determine the phagocytic ability of micro- 
glial cells, we chose apoptotic neural cells as targets, because they would most 
closely approximate natural targets in vivo. Accordingly, dissociated neural pro- 
genitor cells were treated with ultraviolet light for 15 min, stained with 5(6)- 
TAMRA, succinimidyl ester (Invitrogen), and washed thoroughly with cold PBS 
before they were used for incubation with cell culture and fed on to wild-type or 
Mecp2"” microglial cells. After 2 or 5 hours at 37 °C and 5% COs, cells were 
removed, then washed and fixed with 4% PFA O/N. Coverslips were blocked in 
10% chicken serum in 0.1 M PBS containing 0.3% Triton X-100 and 0.5% BSA, 
followed by incubation with anti-CD11b (eBioscience, 1:100) for 1h at room 
temperature. Coverslips were then washed for 10 min three times at room tem- 
perature in 0.1% Triton X-100 (Sigma) in PBS, followed by incubation with Alexa- 
fluor 488 chicken anti-rat IgG antibodies (1:1,000, Invitrogen) for 1h at room 
temperature. Coverslips were washed again with 0.1% Triton X-100 in 0.1 M PBS 
(10 min, three times) and mounted with Aqua-Mount (Lerner Laboratories) on 
slides. Slides were analysed using confocal microscopy and Image] software. 
Whole-animal plethysmography. Plethysmography was performed during the 
dark phase of the diurnal cycle from 9:30 to 12:30 to assess normal waking res- 
piratory parameters. Respiratory parameters were assessed by whole- 
animal plethysmography in unrestrained C57BI/6J (wild-type), Mecp2 ’”, wild- 
type — Mecp2~”, Mecp2-'” > Mecp2~””, wild-type > Mecp2*!~, Lysm*”, 
Mecp2!*P¥ Lysm*"* and Mecp2°**°?/” mice. Animals were placed individually 
into 1,000 cm? Plexiglas chambers (Buxco) and allowed 15 min to acclimate. The 
chamber was continuously flushed with dry room-temperature air (24 + 0.5 °C) 
delivered by three computer-driven mass-flow regulators connected to pure Oo, 
N; and CO, (total flow: 11min‘). The flow signal was recorded and analysed 
using EMKA IOX 2.7 (EMKA Technologies) and Spike 5.21 (CED) software. 
Breathing parameters were calculated from a calibrated flow signal derived from 
a differential pressure sensor connected to the plethysmography chamber using 
the equation of Drorbaugh and Fenn*°. Real-time chamber conditions (temper- 
ature and humidity) and atmospheric pressure were continuously measured and 
the calculation of tidal volume was automatically adjusted to account for changes 
in these variables. Inspiration and expiration were detected using a pressure trans- 
ducer calibrated before each experiment by injecting 1 ml of air. The pressure 
signal was amplified, digitized and recorded using IOX software (EMKA 
Technologies) and Spike2 software (Cambridge Electronic Design), concurrently. 
Animals were left in chambers until sufficient data representing periods of 
quiescent breathing were collected to allow analysis. Animals were then returned 
to home cages. 

Analysis of plethysmography. For apnoea scoring, plethysmography traces were 
hand-scored for apnoeas using the following criteria: expiratory time, Ty > 1s 
measured exclusively during periods free of grooming, sniffing or locomotor 
behaviour, such that false positives were excluded. 

Interbreath irregularity. Interbreath variability was derived from the absolute 
value of ((Tyotayn — Ttotain +1)/Ttotaln +1)> Where Tyotan is the duration of the 
breathing cycle during the nth breath. In each mouse this variable was determined 
over 100 consecutive breaths during periods when the animals were quiescent (no 
locomotion or sniffing) and the average value produced a single score per mouse. 
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This score was averaged for each group of mice and expressed as a percentage 
(+s.e.m.). 

Open-field activity. Behavioural testing in a novel open field was also performed 
during the dark phase of the diurnal cycle from 21:30 to 23:30 in the vivarium, such 
that measured locomotion would most closely correspond to normal waking 
activity. Animals were brought to the testing location within the vivarium and 
allowed to acclimate for 30 min before beginning the assay. Open field behaviour 
was measured using a Versamax activity monitor (AccuScan Instruments). Each 
mouse was placed initially in the centre of the monitoring cage, and allowed to 
roam freely for 20 min before being returned to the home cage. Animal activity 
levels were examined using Versamax software. 

Accelerating rotarod. Motor coordination and balance were assessed on an 
Economex accelerating rotarod (Columbus Instruments) that had the capacity 
to test four mice simultaneously. The testing procedure consisted of two training 
phases and a testing phase: stationary training on a non-rotating rod, constant- 
speed training on a rod rotating at a speed of 2.0 r.p.m. and testing on an 
accelerating rotarod (acceleration = 0.1 r.p.m.). Latency to fall from the rotarod 
was recorded. For stationary training, the mouse was placed on the non-rotating 
rod facing the back side of the apparatus. The mouse was required to remain on the 
rod for 60s before it was removed from the apparatus. If the mouse fell before the 
60s cutoff per trial, the animal was placed back onto the rod. This procedure was 
repeated for two trials with no inter-trial interval.For constant-speed training, the 
rod was adjusted to spin at a constant speed of 2.0 r.p.m. The mouse was placed on 
the rod facing the back side of the apparatus, and was required to remain on the 
rotating rod for 60s before it was removed from the apparatus. If the mouse fell 
before the 60s cutoff per trial, the animal was placed back onto the rod. This 
procedure was repeated for a total of two trials with no inter-trial interval. For 
accelerating rotarod testing, the rod was adjusted to spin at a constant speed of 
2.0 r.p.m. The acceleration (acceleration = 0.1 r.p.m.) was started and the latency 
to fall recorded. 

Microglia primary culture. Mouse mixed glia cultures were prepared from P2 
mouse neonates as follows. Brains were excised and placed in ice-cold HBSS. 
Neocortical tissue was removed, meninges discarded and minced with forceps. 
Minced tissue was then incubated in 2 mg ml * papain (Sigma) in HBSS at 37 °C 
for 30 min. After adding cold heat-inactivated fetal bovine serum, and DNase, the 
tissue was washed twice with cold HBSS by re-suspending the tissue and then 
pelleting in a 4°C centrifuge. To obtain a single-cell suspension, the tissue was 
triturated gently by pipetting through a 5 ml serological pipette 20 times and then 
filtered through a 70m nylon filter (BD Biosciences). Mixed glial cells were 
cultured with culture medium consisting of DMEM/F12 with 10% fetal bovine 
serum (Invitrogen), GlutaMAX (Invitrogen) and 1% penicillin/streptomycin in a 
5% CO3/37 °C incubator, changing medium after 7 days, for a total of 14 days. To 
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obtain microglia, flasks were shaken at 120 r.p.m. for 12h at 37 °C. Supernatants 
were collected, spun at 300g for 7 min, and then washed twice by re-suspending the 
cells in fresh medium and centrifuging at 25 °C. Cell pellets were re-suspended in 
culture medium and seeded at 5 X 10° cells per millilitre onto 24-well inserts 
(5 X 10° cells per well). Microglia were allowed to rest for 48h before MHCII 
and cytokine expression assays. 

Bone-marrow-derived macrophage culture. Macrophage cultures were pre- 
pared from bone marrow as follows: bone marrow was isolated as above. Five 
hundred thousand cells were pipetted into each well of a 24-well plate in 1 ml 
culture medium consisting of DMEM/F12 with 10% fetal bovine serum 
(Invitrogen), 10ngml! rMCSF (eBioscience) and 1% penicillin/streptomycin 
in a 5% CO,/37 °C incubator, changing 500 tl medium after 3 days and 6 days. 
At 9 days, mature macrophages were washed three times with 37 °C 0.1 M PBS to 
remove all non-adherent cells, then placed in 1 ml culture medium consisting 
of DMEM/F12 with 10% fetal bovine serum (Invitrogen) and 1% penicillin/ 
streptomycin. Cells were allowed to rest for 6h before further treatment. 
Enzyme-linked immunosorbent assay of bone-marrow-derived macrophage 
supernatants. Bone-marrow-derived macrophages were prepared as above. Cells 
were treated in triplicate wells with cytokines (50 ng ml? rIL4 or rIFNy), or plain 
medium (control) and placed in a 5% CO,/37 °C incubator for 72h. If cells were 
subsequently treated with lipopolysaccharide (LPS; 100ngml~'), they were 
removed from the incubator at 48 h, LPS was added and cells replaced for the 
final 24h of incubation. At 72 h, 1 ml supernatant was removed from each well and 
frozen immediately at — 80 °C. Samples were thawed simultaneously, an aliquot of 
each diluted 1:5 in manufacturer’s diluent and enzyme-linked immunosorbent 
assay was performed with triplicate samples, according to the manufacturer’s 
instructions (R&D). 

TUNEL. Sections (20 1m) were sliced by cryostat and assayed the same day. 
Sections were permeabilized for 1h using neuropore reagent (Trevigen) then 
labelled using the In situ Cell Death Detection Kit, Fluorescein (Roche) according 
to the manufacturer’s instructions (using floating sections). Sections were washed 
3 X 10 min in 0.1M PBS, incubated for 30s in 1:20,000 DAPI reagent, washed 
1X 10min in 0.1M PBS, mounted on slides with Aqua-Mount (Lerner) and 
coverslipped. Images were taken using a Zeiss LSM 700 confocal microscope. 
Six total slices from the cerebellum of three mice were imaged per group. Five 
images were taken per slice, and a randomly selected 100 im X 100 jm section of 
the granular layer was sampled for each image. The area of staining was measured 
for each field. The staining per field was used to calculate statistics by two-way 
ANOVA with Bonferroni post hoc test using GraphPad Prism. 


30. Drorbaugh, J. E. & Fenn, W. O. A barometric method for measuring ventilation in 
newborn infants. Pediatrics 16, 81-87 (1955). 
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Notch-dependent VEGFR3 upregulation allows 
angiogenesis without VEGF-VEGFR2 signalling 
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Developing tissues and growing tumours produce vascular 
endothelial growth factors (VEGFs), leading to the activation of 
the corresponding receptors in endothelial cells. The resultant 
angiogenic expansion of the local vasculature can promote physio- 
logical and pathological growth processes’. Previous work has 
uncovered that the VEGF and Notch pathways are tightly linked. 
Signalling triggered by VEGF-A (also known as VEGF) has been 
shown to induce expression of the Notch ligand DLL4 in angiogenic 
vessels and, most prominently, in the tip of endothelial sprouts~’. 
DLI4 activates Notch in adjacent cells, which suppresses the 
expression of VEGF receptors and thereby restrains endothelial 
sprouting and proliferation”*°. Here we show, by using inducible 
loss-of-function genetics in combination with inhibitors in vivo, 
that DLL4 protein expression in retinal tip cells is only weakly 
modulated by VEGFR2 signalling. Surprisingly, Notch inhibition 
also had no significant impact on VEGFR2 expression and induced 
deregulated endothelial sprouting and proliferation even in the 
absence of VEGFR2, which is the most important VEGF-A receptor 
and is considered to be indispensable for these processes. By 
contrast, VEGFR3, the main receptor for VEGF-C, was strongly 
modulated by Notch. VEGFR3 kinase-activity inhibitors but not 
ligand-blocking antibodies suppressed the sprouting of endothelial 
cells that had low Notch signalling activity. Our results establish 
that VEGFR2 and VEGFR3 are regulated in a highly differential 
manner by Notch. We propose that successful anti-angiogenic 
targeting of these receptors and their ligands will strongly depend 
on the status of endothelial Notch signalling. 

Angiogenesis is prominently controlled by the activity of Notch in 
the endothelium of developing mice, zebrafish embryos and tumour 
models*’. Although the precise mechanisms linking the Notch and 
VEGF pathways remain incompletely understood, the activation of 
DLL4-Notch by VEGFR2 and the repression of VEGFR2 expression 
downstream of Notch activation are seen as two crucial processes 
regulating endothelial sprouting and angiogenesis'**'°"’. To study 
the role of VEGFR2 in vivo and circumvent the embryonic lethality 
of a global knockout of Vegfr2 (also known as Flk1 and Kdr) in mice, 
we generated inducible endothelial-cell-specific loss-of-function 
mutants (Vegfr2“®°) by combining loxP-flanked (floxed) Vegfr2 
alleles Var ree and tamoxifen-inducible Cre (CreERT2) 
transgenes’*"*. Following tamoxifen administration from postnatal 
day 3 (P3) to P5, analysis of the retinal vasculature at P7 confirmed 
that endothelial sprouting and vessel density were strongly reduced in 
Vegfr2'“"© mutants (Fig. 1a, b). 

Surprisingly, blocking Notch activation (for 48h before analysis) 
with the y-secretase inhibitor DAPT (N-[N-(3,5-difluorophenacetyl)- 
L-alany]]-S-phenylglycine t-butyl ester) strongly enhanced angiogenesis 
even in these mutant animals (Fig. 1a, b and Supplementary Fig. 2a, b). 
Because Notch is not the only substrate of y-secretase and because 


DAPT acts on all cell types, we combined Vegfr2"*/#*4 and 
pil4toe/foxed mice!> to generate endothelial-cell-specific double 
mutants. Deletion of the Dil4 gene between P1 and P3 prominently 
increased sprout formation and vascular density (Fig. 1c), the latter of 
which reflects the known suppression of endothelial cell proliferation 
by Notch**’. Confirming the DAPT data, this increase in angiogenesis 
was not abolished by also targeting Vegfr2, even though both gene 
products were no longer detectable at the protein level (Fig. 1c and 
Supplementary Fig. 2c, d). The outgrowth and density of the Dil4‘4"°/ 
Vegfr2'“F° vasculature were slightly reduced compared with those of 
Di1l4'“*© mutants, whereas the numbers of sprouts and filopodia at the 
angiogenic front (that is, the distal region of the growing vasculature) 
were similar (Fig. le and Supplementary Fig. 2c, f). We also compared 
angiogenesis in the retinas of mice with different combinations of 
floxed Notch1 and Vegfr2 alleles and did not observe any significant 
differences in vascular density or sprouting between Notch1'*?° and 
Notch1'*"“/ Vegfr2'“"° retinas (Supplementary Fig. 2e, f). 

The Rbpj gene encodes the transcription factor RBP-J«, which 
binds to the intracellular domain of activated Notch (NICD) and is 
indispensable for Notch signalling’. The phenotype of inducible 
endothelial-cell-specific Rbpj“"° mutants manifests itself only after 
P4, which enabled us to study the effects of VEGFR2 and VEGF-A 
inhibition in this model from P4 to P7 (Fig. 1d). Like the targeting of 
the Vegfr2 gene, the administration of VEGF-A- or VEGFR2-blocking 
antibodies efficiently reduced endothelial cell sprouting and vessel 
density in the retinas of control mice but did not significantly impair 
the deregulated angiogenic growth in Rbpj'“"© mutants (Fig. 1d, e and 
Supplementary Fig. 3). This finding further supported the idea that 
angiogenesis in Notch-deficient vessels can occur in the absence of 
VEGF-A or VEGFR2 activity. Moreover, despite the high vessel density 
and extensive sprouting in the Rbpj“P° and Dil4'“F“ vasculature, vessel 
outgrowth towards the VEGF-A-rich (avascular) retinal periphery was 
strongly reduced (Fig. 1d, e). Thus, in contrast to the existing models of 
the crosstalk between VEGF and Notch, these results suggest that 
endothelial responses to the tissue-derived VEGF-A gradient are not 
enhanced by the loss of Notch activity. 

Previous studies have reported that VEGF-A-VEGER2 signalling is 
important for the upregulation of Dll4 expression in sprouting 
endothelial tip cells**, which, in turn, is thought to activate Notch 
signalling in adjacent (stalk) cells and thereby suppress sprouting 
behaviour. However, the strong effects of Dll4 or Notch inactivation 
in the Vegfr2'“"° background (Fig. 1a-e) suggested that a substantial 
level of endothelial Notch signalling was maintained without VEGFR2 
function. Indeed, control and Vegfr2'“"© sprouting tip cells and arteries 
displayed similar anti-DLL4 antibody staining (Fig. 2a and Sup- 
plementary Fig. 4), and weaker protein signals were only seen within 
the mutant capillary network. Dil4 and Hoe messenger RNA levels 
were also only partially reduced in Vegfr2'“"° lungs, a tissue that was 
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chosen for analysis because of its particularly high endothelial cell 
content (Fig. 2c). The level of NICD protein was lower in Vegfr2’4"° 
lungs, but NICD was still readily detectable by immunoblotting 
(Fig. 2c). By contrast, VEGFR3, which has previously been shown to 
be positively regulated by VEGFR2 signalling, showed the expected 
decrease in expression in the Vegfr2'“F© endothelium (Fig. 2b, c and 
Supplementary Fig. 4c). These results indicate that substantial DLL4 
expression is maintained in the absence of VEGFR2, while the 
VEGEFR3 protein levels are strongly reduced. 

Active Notch is widely seen as a potent negative regulator of 
VEGFR2 and VEGFR3 expression*”’, but published reports provide 
an inconsistent picture. For example, overexpression of DLL4 or 
NICD in cultured human umbilical vein endothelial cells (HUVECs) 
was shown to repress VEGFR2 transcription’”'*. By contrast, DAPT 
treatment of HUVECs did not lead to significant changes in VEGFR2 
protein or VEGFR2 transcript levels, despite increased proliferation 
and sprouting'””? (Supplementary Fig. 5a). In the mouse retina, both 
Vegfr2 and Vegir3 mRNAs have been found to be only slightly up 
gulated in Dil4*’~ heterozygotes or after DAPT administration”®. In 
zebrafish, vegfr3 (also known as flt4) mRNA expression has been 
shown to be increased in rbpj morphants but not in d/l4 morphants*”. 
Our own data indicate the selective regulation of VEGFR3 but not 
VEGFR2 downstream of Notch. In arteries, in which Notch is more 
active, DLL4 was strongly expressed; however, the endogenous levels 
of the VEGFR3 protein were low. By contrast, VEGFR2 was present in 
arteries and veins (Supplementary Fig. 5b-d). 

Furthermore, we found a strong increase in the VEGFR3 protein but 
not the VEGFR2 protein in the Rbpj‘*" angiogenic front (Fig. 2e). 
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Figure 1 | Notch inhibition promotes 
angiogenesis independently of VEGFR2. 

a, b, Quantification of sprouting defects and 
representative confocal images of the isolectin-B4- 
stained angiogenic front of Vegfr2'“F° mutants, as 
well as the effect of DAPT treatment (see also 
Supplementary Fig. 2a, b). Ctrl, control; R2, 
Vegfr2'“*°. ¢, Induction of endothelial-cell-specific 
DIl4 or Vegfr2 deletion from P1 to P3 and analysis 
at P6. Note the strongly enhanced angiogenesis in 
DIl4*¥° and Dil4 4" Vegfr2'4*° mutants 
compared with control and Vegfr2“"“ retinas. 

d, Isolectin-B4-stained control and Rbpj'“"° P4 
and P7 retinas (tamoxifen administration from P1 
to P3) treated with control IgG, or anti-VEGFR2 or 
anti- VEGF-A antibodies (Abs) from P4 to P7. The 
orange circles facilitate the comparison of vascular 
progression. e, Quantification of vascular 
parameters of retinas in c and d. See 
Supplementary Figs 2 and 3 for additional data. D4, 
DII4AFS, D4/R2, DIl4*ES/ Vegfr2“"°, Rbpj, 
Rbpj'“"°. a, e, Data are presented as mean + s.e.m.; 
n= 12. *, P<0.05 compared with the left-most 
histogram bar unless otherwise indicated; NS, not 
significant. 
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Upregulation of the VEGFR3 protein was also observed in lung lysates 
and sections from Rbpj*"“ and Dil4“"° mutants, whereas the VEGFR2 
level was not significantly altered (Figs 2d and 3c and Supplementary 
Fig. 6). Interestingly, this difference was not mirrored at the mRNA 
level: the levels of Vegfr2 and Vegfr3 mRNA were not substantially 
increased, which hints at post-transcriptional regulation of VEGFR3 
levels by Notch (Fig. 2d). The differential regulation of VEGFR protein 
levels was also mirrored by the activity levels of these receptors. The 
level of phosphorylated VEGFR3 (phospho-VEGFR3) was strongly 
increased in Rbpj'“"© lung lysates, whereas the increase in phopho- 
VEGFR2 was more modest and mirrored the increase in vascular 
endothelial cadherin (CDHS5) (which reflects the number of endothelial 
cells) in the Rbpj“"© mutant model (Figs 2d and 3a-c). VEGFR2 and 
Notch play opposing roles in the regulation of VEGFR3 levels. Their 
actions seem to be independent because the combined deletion of Dil4 
and Ve egfr2 (DIARY Vegfr2' AEC) or the administration of DAPT to 
Vegfr2’”-~ pups led to a significant increase in VEGFR3 protein com- 
pared with control or Vegfr2'“" mutant pups (Fig. 3c, d; see also Fig. 2b). 

To investigate the functional consequences of high levels and activity 
of VEGFR3 protein in endothelial cells with low or compromised 
Notch signalling, we first used a monoclonal antibody that specifically 
blocks ligand binding to VEGFR3 (mF4-31C1). Daily injection from 
P4 to P7 with 50mgkgd ay ~" of mF4-31C1 did not impair vessel 
growth in control or Rbpj'“"~ mutants (Fig. 3e, f), suggesting that the 
ligand-dependent activation of VEGFR3 is not important for develop- 
mental angiogenesis, which is consistent with previous reports*'”’. 
We then carried out short-term administration (24h) of DAPT and 
the kinase inhibitor MAZ51. MAZ51 potently inhibits VEGF-C- or 
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Figure 2 | VEGFR2 strongly regulates VEGFR3 protein levels but not DLL4 
at the angiogenic front. a, b, Whole-mount triple immunofluorescence for 
DLL4 or VEGFR3 (red), VEGFR2 (green) and isolectin B4 (IB4; blue) of P6 
control and Vegfr2'“*“ retinas. The deletion of Vegfr2 for 5 days strongly 
compromised angiogenesis and the expression of VEGFR3 but not of DLL4 in 
tip endothelial cells. The arrows indicate endothelial cells expressing VEGFR2 
(orange) and devoid of VEGFR2 (white). c, d, Immunoblotting (left) and 
quantitative PCR an alysis with reverse transcription (qRT-PCR; right) of 
Vegfr2'“F° and Rbpj**© P6 mouse lungs for the indicated proteins and 
transcripts. The CDH5 protein levels reflect endothelial cell content. Data are 
presented as mean + s.e.m.; n = 4. *, P< 0.001 compared with control group 
data (blue bars) unless otherwise indicated; NS, not significant. P- VEGFR2, 
phospho-VEGER2. e, Whole-mount triple immunofluorescence for VEGFR3 
(red), VEGFR2 (green) and isolectin B4 (blue), showing upregulated VEGFR3 
but not VEGER2 protein in Rbpj'“"“ vessels. a, b, e, Autofluorescent red blood 
cells are visible as round spots inside the vessels. 


VEGF-D-induced activation of VEGFR3 but only weakly impairs 
VEGEFR2 activation by VEGF-A”. In contrast to the VEGFR3-blocking 
antibody, MAZ51 strongly inhibited both VEGF-C-dependent and 
VEGF-C-independent VEGFR3 phosphorylation in cultured cells 
(Supplementary Fig. 7). Although MAZ51 alone only weakly affected 
the number of filopodia and sprouts at the angiogenic front of the 
control retinal vasculature, this inhibitor strongly suppressed the 
enhanced sprouting caused by DAPT (Fig. 4a, b). MAZ51 also reduced 
the phosphorylation of VEGFR3 in Rbpj'“"© lung lysates (Fig. 4c). 
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Figure 3 | Notch regulates VEGFR3 activity independently of VEGFR2. 

a, Controls for VEGFR3 and VEGFR2 phosphotyrosine (pY) 
immunoprecipitation (IP) from P6 lungs. Phospho- VEGER3 is present at 
increased levels in VEGF-C-injected P6 pups. The specificity was confirmed by 
lambda protein phosphatase pre-incubation. IB, immunoblotting antibody; M, 
molecular weight markers. b, Phosphotyrosine immunoprecipitation from 
lung lysates, showing strongly increased phospho-VEGER3 in Rbpj**° 
mutants. Smaller changes were observed for phospho-VEGEFR2, total VEGFR1 
and phospho-VEGFRI. c, Immunoblot analysis, showing strongly increased 
VEGER3 protein levels in Dil4'“®© and Dil4'“®“/Vegfr2“*° P6 lung lysates, 
which was only partly attenuated in the absence of endothelial VEGFR2. 

d, Isolectin B4 combined with VEGFR2 (green) and VEGFR3 (red) staining of 
control and Vegfr2“*© retinas. DAPT administration for 48 h before dissection 
restored high VEGFR3 protein levels at the Vegfr2“F° angiogenic front 
(compare \ with Fig. 2b). e, Confocal images of isolectin-B4-stained P7 control 
and Rbpj'**“ retinas (with tamoxifen administration from P1 to P3) injected 
with control IgG or anti- VEGFR3 antibody from P4 to P7. f, Quantification of 
vascular parameters in control and Rbpj'“"© mice treated with VEGFR3- 
blocking antibody (anti-R3) or IgG, as indicated. Data are presented as 

mean + s.e.m.; 1 = 12. *, P< 0.05 compared with control group data shown to 
the left of each graph unless otherwise indicated; NS, not significant. 


These results point to a crucial role for VEGFR3 in the regulation of 
sprouting in endothelial cells with low Notch signalling activity, which 
is also in agreement with previous reports using morpholino targeting 
of vegfr3 in zebrafish®”’. 

By contrast, the DAPT-induced increase in endothelial cell density 
at the angiogenic front was not significantly altered by the addition of 
MAZ51 (Fig. 4a and Supplementary Fig. 8). Providing further evidence 
for the important role of VEGFR3 in Notch-controlled angiogenesis, 
the injection of Vegfr2'“F° pups with DAPT and MAZ51 (at P5 for 
24h) led to a striking decrease in the number of sprouts and filopodia 
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Figure 4 | Inhibition of VEGFR3 kinase activity suppresses Notch-regulated 
sprouting. a, b, Retina whole-mount immunostaining for isolectin B4 (blue) 
after 24h treatment (P5 to P6) with the indicated inhibitors. MAZ51 efficiently 
blocked sprouting (red dots) of DAPT-treated endothelial cells without having 
an overt effect on vessel density (see also Supplementary Fig. 8). Vehicle, 
dimethylsulphoxide (DMSO). c, MAZ51 had strong effects on VEGFR3 but not 
VEGEFR2 phosphorylation in vivo. Lungs from pups with the indicated 
genotypes and treatments were subjected to phosphotyrosine 


and the vascular density compared with DAPT-treated Vegfr2'4"° 
littermates (Fig. 4d, e). Moreover, administration of AAL-993, a kinase 
inhibitor that blocks all VEGFRs, led to a significant decrease in vascular 
density and endothelial sprouting in DAPT-treated retinas (Supplemen- 
tary Fig. 9). These results indicate that the inhibition of Notch signalling 
allows vessel growth in the absence of VEGF-A-VEGFR2 signalling, 
and this vessel growth requires the kinase activity of VEGFR3 but 
not the binding of VEGF-C. This is consistent with recent reports 
proposing ligand-independent VEGFR3 activation**”. 

The initially simplistic models describing the molecular crosstalk 
between the VEGF and Notch pathways need to be revised. It has 
been reported that endothelial-cell-specific deletion of the Vegfr3 gene 
postnatally in mice leads to enhanced angiogenesis”. The authors of this 
report proposed that VEGFR3 has ‘active’ (ligand-dependent and pro- 
angiogenic) and ‘passive’ (ligand-independent and anti-angiogenic) 
signalling modes, among which the passive mode is mediated by 
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immunoprecipitation followed by VEGFR3 and VEGFR2 immunoblotting. 
The numbers below the blots indicate the relative quantification of the signals. 
d, e, Isolectin-B4-stained control and Vegfr2‘“"° P5 retinas after 24h treatment 
with the indicated chemical inhibitors. Inhibition of the kinase activity of 
VEGER3 strongly impaired angiogenesis in DAPT-treated Vegfr2'“" mutants. 
b, e, Data are presented as mean + s.e.m.; n = 12. *, P< 0.05; A, P= 0.1; NS, 
not significant; compared with control group data shown to the left of each 
graph unless otherwise indicated. 


Notch activation. Our own data indicate that VEGFR3 activity is pro- 
angiogenic in endothelial cells with low or no Notch signalling activity. 
Both findings are not incompatible and might reflect a dynamic, Notch- 
controlled switching of VEGFR3 function in growing blood vessels. 
Our results also fundamentally change the understanding of the 
crosstalk between Notch and VEGFR2 in angiogenesis (Supplemen- 
tary Fig. 1). VEGFR2 is not essential for Dll4 expression in tip cells, 
suggesting that other upstream regulators, such as cell—matrix signal- 
ling, have important roles*®*’’. Although VEGFR2 is not regulated or 
only weakly regulated by Notch, both Notch and VEGFR2 modulate 
VEGFR3 protein and activation levels independently and in opposite 
directions. In settings with low Notch activity, VEGFR3 upregulation 
allows strong, ligand-independent and highly deregulated angiogenesis 
even in the absence of VEGF-VEGFR2 signalling, which mimics 
aspects of growth-factor-independent (autocrine) cancer cell growth”. 
This mode of vessel growth might contribute to the excessive and 
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deregulated angiogenesis that is observed in tumours treated with 
Notch inhibitors’. Probing the status of vascular Notch or VEGFR3 
activation might be relevant for patients who do not respond to 
anti- VEGF treatment, a complicating factor in the therapy of cancer 
and age-related macular degeneration””°. This information could 
yield valuable clues for selecting suitable therapeutic strategies that 
could reduce the resistance to current anti-angiogenic therapies. 


METHODS SUMMARY 


Mutant mice, inducible genetic experiments and pharmacological inhibition. 
For inducible and endothelial-cell-specific genetic loss-of-function experiments, 
we interbred Vegfraioe4 (ref, 12), D1I4"°**4 (ref. 15), Notch 1"*°4 (see Methods) or 
Rbpj'***4 (ref. 16) mice with transgenic mice expressing the tamoxifen-inducible 
recombinase CreERT2 under the control of the endothelial Cdh5 (ref. 14) or Pdgfb 
(ref. 13) promoter. 

Cre activity and gene deletion were induced by intraperitoneal injections of 
50 pg (P1-P3) or 100 pg (P3-P5) tamoxifen (Sigma, T5648; 2 mg ml '). In some 
experiments, Notch signalling was inhibited in half of the pups by subcutaneous 
injection with 100 mgkg”' DAPT (Calbiochem). The kinase activity of VEGFR3 
was inhibited with 10mgkg ’ MAZS51 (Sigma). For simultaneous inhibition of 
the kinase activity of all VEGFRs, we used 25mgkg~' AAL-993 (Enzo Life 
Sciences). To block signalling with monoclonal antibodies, daily intraperitoneal 
injections with 50mgkg * anti-VEGFR2 (ImClone Systems, DC101), anti- 
VEGER3 (ImClone Systems, mF4-31C1), anti- VEGF-A (Genentech, G6-31) or 
control mouse or rat IgG antibodies (Jackson ImmunoResearch) were given at P4, 
P5 and Pé6. 

Ethical review. All animal experiments were performed in compliance with the 
relevant laws and institutional guidelines and were approved by local animal ethics 
committees. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mutant mice, inducible genetic experiments and pharmacological inhibition. 
To delete Vegfr2 in the postnatal vasculature, we interbred Vegfr2"**4 mice’? with 
transgenic mice expressing the tamoxifen-inducible recombinase CreERT2 under 
the control of the endothelial Cdh5 promoter". Vegfr2" loxed/floxed Cah 5(PAC)- 
CreERT2"'* males were interbred with Vegfr2"**“/"**¢ females to generate litters 
containing Vegfr2iexe™ floxed Cah 5(PAC)-CreERT2"* (Vegfr2' AEC) and control 
(Vegfratoe/Moxed) littermates. Cre activity and gene deletion were induced by 
intraperitoneal injections with 100g tamoxifen (Sigma, T5648; 2mg ml) in 
every pup of the litter, every 24h at P3, P4 and P5 (Fig. 1). Notch signalling was 
inhibited in half of the pups by subcutaneous injection at P5 and P6 with 100 mg 
kg! DAPT (Calbiochem) dissolved in 10% ethanol and 90% peanut oil. Retinas 
were collected at P7, 48h after the first injection of DAPT. Control mice were 
injected with vehicle only. We also induced Vegfr2 deletion from P1 to P3 and 
analysed tissues at P6 (Figs 2 and 3). In addition, Notch signalling was inhibited in 
half of the pups by subcutaneous injection of DAPT at P4 and P5 (Fig. 3d). 

For combined endothelial-cell-specific loss-of-function of Dll4 and Vegfr2, we 
interbred Dil4"°""4 (ref. 15) with Vegfr2" oxed (ref, 12) and Pdgfb-iCre mice'*. Because 
Dll4 is a haploinsufficient gene**', it was necessary to compare the vascular pheno- 
types of animals from two different types of breeding. Specifically, we interbred 
DIj4foxed’+ Veegfi. oxed’* Ddofb-iCre!!* males with DIG oredoxed Vo gfy gfloxed + 
females to generate control (Pdgfb-iCre*!*), Dilla“! Vegfr2*!* Pdefb- 
iCre!’* (pDII4A®°), and Dij4hoxed/ loxed Vegfr2ioxedfoxed pdofh-iCre!!* (DIS 
Vegfr24®°) littermates. To generate Dil4*/* Vegfr2"*""**4pdofh-iCre’!* 
(Vegfr2'“"° in Fig. 1c) and matching littermate control animals (Pdgfb-iCre'’*), 
we interbred Dil4"**“’* Vegfr2"**""* Pdefb-iCre’* males with Dil4*/* 
Vegfr2iexedifoxed females in the same genetic background. For the combined 
endothelial-cell-specific loss-of-function of Notch1 and Vegfr2, we interbred 
Notch1"**4 mice®?, Vegframexed and Cdh5(PAC)-CreERT2 mice to obtain the 
animals with the genotypes described in Supplementary Fig. 2e. To induce Cre 
recombinase activity and gene deletion, animals were injected with 50g 
tamoxifen (1 mg ml’), every 24h, at P1, P2 and P3 and dissected at P6. 

To disrupt transcriptional responses downstream of activated Notch in 
endothelial cells, we interbred Rbpj"°**4 mice!® with the Pdgfb-iCre transgenics. 
Rbpj °°" Pdofb-iCre'!* males were interbred with Rbpj'°**/"*** females to 
generate litters containing Rbpj'*“/"*¢Pdgfb-iCre"!* (Rbpj*®°) and control 
(Rbpj'*""*"4) Jittermates. All pups were injected with 50g tamoxifen 
(1 mg ml”), every 24h, from P1 to P3 and dissected at P6 (Fig. 2) or P7 (Figs 1 
and 3). 

For the combined pharmacological inhibition of Notch and VEGFR signalling 
in vivo, we subcutaneously injected the indicated combinations of vehicle 
(dimethylsulphoxide (DMSO)), the Notch signalling inhibitor DAPT 
(Calbiochem; 100mgkg~'), the VEGFR3 inhibitor? (MAZ51, Sigma; 10mg 
kg~') or the pan-VEGFR tyrosine kinase inhibitor (AAL-993, Enzo Life 
Sciences; 25 mg kg ')*>*4, Injections of the vehicle and inhibitors DAPT and 
MAZ51 were done twice, at P5 and 8h before dissecting the animals at P6: that 
is, for a 24-h period (16h + 8h) in total. Injection of the vehicle and inhibitors 
DAPT and AAL-993 was done at P5, 24h before collecting the retinas at P6. When 
Vegfr2 deletion was combined with chemical inhibition (Fig. 4), all of the control 
and Vegfr2'“P“ litter pups were injected with 50 tg tamoxifen (1 mg ml!) from P1 
to P3 before receiving the inhibitor injections at P4 as described above. 

For the combined deletion of Rbpj and inhibition of VEGFR signalling in vivo, 

we injected 50 ug tamoxifen (Imgml') from P1 to P3 and then injected the 
monoclonal antibodies anti- VEGFR2 (ImClone Systems, DC101)*°, anti- VEGFR3 
(ImClone Systems, mF4-31C1)”, anti- VEGF-A (Genentech, G6-31)°* or control 
mouse or rat IgG antibodies (Jackson ImmunoResearch) at 50 mg kg day’ from 
P4 to Po. 
Immunohistochemistry. All of the immunostaining was carried out with littermate 
tissues processed simultaneously under the same conditions. With the exception of 
the isolectin B4 staining of the retinas from Dil4/Vegfr2"**/"*** litters (see 
explanation in the previous section), no comparisons have been made between 
animals from different litters or processed on different days. 

To analyse (and quantify) the retina vascular phenotype and to preserve intact 
endothelial filopodia and sprouts at the angiogenic front, whole animal eyes were 
fixed in 4% paraformaldehyde (PFA) at 4 °C overnight. The following day, the eyes 
were washed in PBS before the retinas were dissected and partially cut in four 
quadrants to allow subsequent flat mounting. After blocking/permeabilization in 
1% BSA with 0.3% Triton for some hours at room temperature or overnight at 4°C, 
the retinas were washed two times in Pblec buffer (1% Triton X-100, 1 mM CaCh, 
1mM MgCl, and 1 mM MnCl, in PBS, pH 6.8) for 20 min and then incubated for 
2h in Pblec buffer containing biotinylated isolectin B4 (Vector Labs, 1:50). 
Following five washes (each 20 min) in blocking solution, retinas were incubated 
with Alexa-Fluor-streptavidin-conjugated antibodies (Molecular Probes, 1:100) 


for 2h, washed three times further and flat-mounted on microscope glass slides 
with Fluoromount-G (SouthernBiotech, 0100-01). Double or triple whole-mount 
immunohistochemistry was performed in retinas fixed for 2 h on ice in 2% PFA or 
at —20 °C in methanol. After fixation, retinas were blocked for 1 h in 1% BSA with 
0.05% Tween and incubated overnight or for 2 h with isolectin B4 (1:50) and the 
following primary antibodies: rat anti- VEGFR2 (1:200, 555307, BD Pharmingen), 
goat anti-VEGFR3 (1:200, AF743, R&D Systems) or goat anti-DLL4 (1:200, 
AF1389, R&D Systems) antibody. For detection, suitable species-specific Alexa- 
Fluor-coupled secondary antibodies (1:500) were used. To detect proteins in P6 
mouse lungs, organs were fixed in 4% PFA. Immunohistochemistry was per- 
formed on 100-um vibratome sections as described above for the retinas. The 
following additional antibodies were used: rat anti-endomucin (from D. 
Vestweber) and rabbit anti-GFP—Alexa Fluor 488 (1:500, A21311, Invitrogen). 
Immunoprecipitation, immunoblotting and qRT-PCR of lung lysates. For 
immunoprecipitations, freshly dissected or snap frozen lungs were lysed in lysis 
buffer (20 mM Tris, 1 mM EDTA, 1 mM dithiothreitol (DTT), 1% NP-40, 0.1% 
SDS, 0.1% deoxycholate, 150 mM NaCl, 1 mM PMSF, 1 mM Na3VO,, 5mM NaF, 
phosphatase inhibitor cocktail SetV (Calbiochem, 524629) and protease inhibitor 
cocktail (Sigma, P2714)). Lysates were centrifuged at 4°C for 30 min at 20,000g, 
and aliquots were set aside for direct input blot analysis. For the immunoprecipi- 
tation, the remaining lysates were pre-cleared for 1h at 4°C with protein-G- 
sepharose or protein-A-sepharose beads. Pre-cleared lysates were incubated with 
primary antibody for 2h at 4°C, after which the beads were added and incubated 
for an additional 2 h at 4 °C. Immunocomplexes were washed five times with lysis 
buffer (without SDS and deoxycholate) and analysed by SDS-PAGE. For the 
detection of phosphotyrosine after VEGFR1 immunoprecipitation, membranes 
were blocked with BSA, and 1 mM Na3VO, was added during the blocking and 
washing steps. The following primary antibodies were used for immunoprecipita- 
tions: control mouse or goat IgG, mouse anti-phosphotyrosine (4G10, 05-321, 
Millipore) and goat anti- VEGFR1 (AF471, R&D Systems). Subsequent detection 
of the immunoprecipitated protein was done with the antibodies described below. 
As a positive control for the detection of phospho-VEGFR3 after phospho- 
tyrosine immunoprecipitation, P5 pups received an intraperitoneal injection of 
200ngg * body weight VEGF-C (Peprotech) or vehicle (PBS) at 30min 
before tissue dissection. As a negative control to confirm the specificity of the 
phospho-tyrosine immunoprecipitation, a sample was treated with lambda protein 
phosphatase (New England Biolabs) before incubation of the pre-cleared 
lysate with anti-phosphotyrosine antibodies. To determine the ligand require- 
ment for VEGFR3 phosphorylation after deletion of Rbpj in endothelial cells, we 
intraperitoneally injected 25mgkg~' mouse VEGFR3-Fc (743-R3-100, R&D 
Systems) at P5 and collected the tissue 24 h later. 

For the analysis of protein levels in mutant mouse pups, the inferior left lobe or 
the whole lung was dissected out, transferred to a reagent tube and frozen in liquid 
nitrogen. On the day of the immunoblot analysis, the tissue was lysed in PL buffer 
(20mM Tris-HCl, pH8.0, 1mM EDTA, 1mM DTT, phosphatase inhibitor 
cocktail SetV, 1mM Na3VOy,, protease inhibitor cocktail (1:10), 1% Triton 
X-100 and 150mM NaCl) and homogenized with a cylindrical glass pestle 
(Potter-Elvehjem). Tissue debris was removed by centrifugation, and the 
supernatant was diluted in loading buffer and analysed by SDS-PAGE and immu- 
noblotting. The following antibodies were used for immunoblotting: rabbit anti- 
VEGER2 (2479, Cell Signaling Technology), rabbit anti-phospho- VEGFR2 (2478, 
Cell Signaling Technology), rat anti- VEGFR3 (14-5988, eBioscience), rabbit anti- 
VEGFRI (sc-316, Santa Cruz Biotechnology), rabbit anti-cleaved-Notch1 (2421, 
Cell Signaling Technology), rat anti-CDH5 (555289, BD Biosciences), mouse 
anti-phosphotyrosine (4G10, 05-321, Millipore) and rabbit anti-tubulin (T5168, 
Sigma). 

All of the immunoblot images shown represent the average result obtained from 
a minimum of two different lung extracts for each group, and each extract was 
blotted in duplicate. The blots shown in Fig. 4c were quantified with the gel 
analysis function in the program Image]. 

For the analysis of mRNA relative expression levels, total RNA was isolated with 
the RNeasy Mini Kit (QIAGEN), and 500 ng per reaction was used to generate 
cDNA with the SuperScript III First-Strand Synthesis System (Invitrogen) and 
oligo(dT) primers. Quantitative PCR with reverse transcription (qRT-PCR) was 
performed by using an ABI PRISM 7900HT Sequence Detection System. TaqMan 
gene expression assays for murine Gapdh, Dll4, Heyl, Vegfr2, Vegfr3, Pecam1, 
Cdh5, Pdgfb and Vegfa were used in combination with TaqMan Gene 
Expression Master Mix (Applied Biosystems). Gene expression was normalized 
to the endogenous control Gapdh. The relative expression differences obtained 
represent the average of the results obtained for two independent animals per 
group from two independent litters. For each animal/cDNA, two separate qRT- 
PCR plates were used, with duplicate reactions for each gene. 
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A fraction of Vegfr2'“P° Cdh5(PAC)-CreERT2 mutant lungs showed reduced 
deletion of Vegfr2 in the lungs and were therefore excluded from the analysis. The 
results shown represent mutants for which the deletion of Vegfr2 was most efficient 
(around 90%, Fig. 2c). 

Quantitative analysis of the retinal vasculature. All of the images shown are 
representative of the vascular phenotype observed in at least eight retinas from 
four pups from two distinct litters per group. All quantifications were done with 
Volocity (Improvision) software on high-resolution confocal images representing 
a thin z section of the retina stained with isolectin B4. The numbers of endothelial 
sprouts and filopodial extensions were quantified at the retina angiogenic front in 
a minimum of 12 fields (sized 1,550 um X 1,550 tm for sprouts and 387 jtm X 387 
uum for filopodia) and four retinas per group. The total number of sprouts or 
filopodia was normalized for a standard endothelial vessel length of 1,000 jum that 
was measured and defined according to published protocols**. The proportion of 
endothelial cell area coverage was calculated in a minimum of eight fields per 
group. Regions of quantification were selected in 1,550 im X 1,550 tm fields, 
including the whole vascular plexus, for genetic deletions starting at P1, or in only 
the angiogenic front (around 1,550 jm X 250 um) for 24-72 h experiments. The 
isolectin-B4-stained endothelial network was selected based on the strong intensity 
of the fluorescent signal. For each vascularized field, the ratio of the isolectin-B4- 
positive area to the total area was calculated and defined as the percentage of the 
vascularized retina area covered by endothelial cells. Vascular progression was 
measured by defining a straight line from the angiogenic front to the centre of 
the retina for each retina quadrant in low magnification stereomicroscope pictures. 
A minimum of 16 quadrants belonging to four retinas per group was used for 
quantification. The P values in all figures were calculated using a two-tailed 
Student’s t-test. 

Image acquisition and processing. Stained and flat-mounted retinas were 
analysed at high resolution with a TCS SP5 confocal microscope (Leica) or at 
lower resolution with an MZ16F stereomicroscope (Leica) coupled to a digital 
camera (Hamamatsu C4742-95). Volocity, Photoshop (Adobe) and Illustrator 
(Adobe) software were used for image acquisition and processing, in compliance 
with Nature’s guide for digital images. All of the images shown in which the 
immunostaining levels were compared are representative of at least six different 
images from three different retina immunostainings per group, where the laser 
excitation and confocal scanner detection settings were the same between groups. 
Cell culture and in vitro assays. HUVECs and immortalized mouse endothelial 
cells (MECs) were isolated and cultured as described previously’. Cells (5x 10°) 
were plated in each well of six-well plates coated with PBS containing 5 pg ml 
fibronectin. To some wells, 1.44 ug ml ~ | recombinant DLL4-His (1389-D4, R&D 
Systems) was added to the coating solution. DMSO or an equal volume of DAPT 
(10 1M) dissolved in DMSO was added to the culture medium in some wells. Cells 
were incubated overnight and starved for 5h before stimulation with vehicle or 
50ngml ' VEGF-A (Peprotech) for 8 min. Cells were lysed in 200 il 1x SDS 
sample buffer and subjected to immunoblotting with rabbit anti-VEGFR2 (2479, 
Cell Signaling Technology), rabbit anti-phospho- VEGFR2 (478, Cell Signaling 
Technology), rat anti- VEGFR3 (14-5988, eBioscience), rabbit anti-cleaved- 
Notchl (2421, Cell Signaling Technology), rat anti-CDH5 (555289, BD 
Biosciences) or rabbit anti-tubulin (T5168, Sigma) antibody. 

To generate human embryonic kidney 293 (HEK293) cells stably expressing 
mouse VEGFR3 (R3) or a tyrosine-kinase dead version (R3'), we first sub- 
cloned in-frame the mouse Vegfr3 cDNA without the 3’ untranslated region 
(UTR) into the vector pcDNA3.1 membrane-Tomato-2A. A single point mutation 
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(Ile1053Phe) in VEGFR3 was generated following the procedures described in the 
QuikChange II Site-Directed Mutagenesis Kit (Stratagene). To generate HEK293 
cells stably expressing mouse VEGFR2 (R2), we first subcloned in-frame the mouse 
Vegfr2 cDNA without the 3’ UTR and stop codon in the vector pcDNA3.1-2A- 
MYFP. HEK293 cells were transfected with one of the three constructs by using 
Lipofectamine 2000 (Invitrogen) and selected in 500 pg ml | G418 (neomycin) for 
2 weeks. Following selection, isolated red or yellow fluorescent clones were picked 
and expanded. Immunoblot analysis was subsequently performed to confirm the 
expression of VEGFR3 or VEGFR2. For stimulation assays, 5 X 10° cells were 
plated in each well of six-well plates, cultured for 1 day to attach and then, on 
the following day, starved overnight (18h). Starved cells were stimulated with 
500 ng ml! VEGE-C, VEGF-A (Peprotech) or vehicle, in the absence or presence 
of the indicated concentrations of anti- VEGFR3 (mF4-31C1) or MAZ51. Cells 
were lysed in 1 X lysis buffer (9803, Cell Signaling Technology), and after removing 
the residual cell debris, the extracts were used for immunoblotting as described 
previously or for enzyme-linked immunosorbent assays (ELISAs). ELISA plates (96 
wells) were coated overnight in capture antibody solution (PBS containing 
11g ml goat-anti mouse VEGFR3 (AF743, R&D Systems) or PBS containing 
lug ml! goat-anti mouse VEGFR2 (AF644, R&D Systems)). The next day, the 
solution was removed, and blocking solution (PBS containing 0.5% BSA and 0.05% 
Tween 20) was added for 1 h. After three washes with wash buffer (PBS containing 
0.05% Tween 20), 100 jl lysate was added in triplicate to the coated plates, which 
were then incubated for 2 h. The plates were washed four times in wash buffer, and 
primary antibodies diluted in blocking buffer were added for 2 h. For the detection 
of the captured phosphorylated VEGFR3 or VEGEFR2 proteins, biotinylated mouse 
anti-phosphotyrosine (16-103, Millipore) was used. For the detection of all 
captured VEGFR3 and VEGFR2, we used rat anti-VEGFR3 (14-5988, 
eBioscience) and rat anti- VEGFR2 (2479, Cell Signaling Technology), respectively. 
Following four additional washing steps, species-specific or streptavidin horse- 
radish-peroxidase-conjugated secondary antibodies were diluted in blocking solu- 
tion and then incubated with the plates for 30 min. The washing procedure was 
repeated and TMB substrate (7004S, Cell Signaling Technology) was used to obtain 
the detection signal. Absorbance measurements at 450 nm and 650nm and the 
relative quantification were performed on a Synergy 2 Biotek microplate reader. 
The ELISAs were performed in triplicate and for three times per condition. The 
ratio of the detected phosphorylated VEGFR3 to the total VEGFR3 (and the 
VEGER2 ratio) was determined for each condition, and the relative difference from 
the non-treated condition was calculated. The P values were calculated using a two- 
tailed Student's t-test. 
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During protein synthesis, the ribosome accurately selects transfer 
RNAs (tRNAs) in accordance with the messenger RNA (mRNA) 
triplet in the decoding centre. tRNA selection is initiated by 
elongation factor Tu, which delivers tRNA to the aminoacyl 
tRNA-binding site (A site) and hydrolyses GTP upon establishing 
codon-anticodon interactions in the decoding centre’°. At the 
following proofreading step the ribosome re-examines the tRNA 
and rejects it if it does not match the A codon”*'°™. It was sug- 
gested that universally conserved G530, A1492 and A1493 of 16S 
ribosomal RNA, critical for tRNA binding in the A site’*"””, actively 
monitor cognate tRNA", and that recognition of the correct 
codon-anticodon duplex induces an overall ribosome conforma- 
tional change (domain closure)’’. Here we propose an integrated 
mechanism for decoding based on six X-ray structures of the 70S 
ribosome determined at 3.1-3.4 A resolution, modelling cognate 
or near-cognate states of the decoding centre at the proofreading 
step. We show that the 30S subunit undergoes an identical domain 
closure upon binding of either cognate or near-cognate tRNA. This 
conformational change of the 30S subunit forms a decoding centre 
that constrains the mRNA in such a way that the first two nucleo- 
tides of the A codon are limited to form Watson-Crick base pairs. 
When UeG and GeU mismatches, generally considered to form 
wobble base pairs, are at the first or second codon-anticodon posi- 
tion, the decoding centre forces this pair to adopt the geometry 
close to that of a canonical CeG pair. This by itself, or with distor- 
tions in the codon-anticodon mini-helix and the anticodon loop, 
causes the near-cognate tRNA to dissociate from the ribosome. 
We determined six X-ray structures of the 70S ribosome at 3.1-3.4 A 
resolution (Supplementary Tables 1 and 2) programmed by 30- 
nucleotide-long mRNAs with the AUG codon and tRNA™* in the 
peptidyl tRNA-binding site (P site) and the A site occupied by 
tRNA," or tRNA (Fig. la and Methods). In one set of experiments, 
tRNA,'*" and tRNA’ were bound to their respective cognate codons 
CUC and UAC in the A site. In a second set of experiments, we 
modelled near-cognate states of the ribosome (Supplementary Fig. 1). 
These states of the ribosome naturally occur during protein synthesis 
but with low probability because binding of cognate tRNA is kinetically 


Figure 1 | Codon-anticodon interactions in the decoding centre on the 70S 
ribosome. a, The mRNA path on the 70S ribosome with the decoding (DC) area 
indicated. b, Close-up view of the mRNA P/A kink with near-cognate tRNA,/. 
Magnesium ions are in green. c, d, The first base pairs of the near-cognate (c) and 
cognate (d) codon-anticodon duplexes and their interactions with A1493 of 16S 
rRNA. e, f, The second base pairs of the near-cognate (e) and cognate (f) codon- 
anticodon duplexes and their interactions with G530 and A1492 of 16S rRNA. 
g, h, A classical wobble UsG pair (g) versus canonical CeG interactions (h); a 
magnesium ion interacting with the base pair is coordinated by protein $12 and 
part of 16S rRNA. All graphical representations were prepared with PyMol. 
Probable hydrogen bonds within 3 A distance are indicated by dashed lines; 
2F, — F, electron density maps are contoured at 1.2 sigma. 
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favoured. We made the ribosome accept near-cognate tRNA by giving it 
only one type of tRNA carrying a mismatch to the A codon along with 
tRNA™*' for the P site. In these complexes the A site was filled either by 
tRNA,'*" and codon UUU with a UsG mismatch in the first position of 
the codon-anticodon helix or by tRNA’ and codon UGC with a GeU 
mismatch in the second position. 

As shown earlier, the mRNA forms a kink between the P and A 
codons (P/A kink), a universal feature of the mRNA path on the 70S 
ribosome that is stabilized by the P site tRNA, 16S ribosomal RNA 
(rRNA) and magnesium ions””°”’ (Fig. 1b and Supplementary Fig. 2). 
The single mismatch states described above represent bona fide near- 
cognate complexes expected to have standard wobble UeG and GeU 
base pairs. At our data resolution (3.1-3.5 A) we can confidently assign 
the general base pairing (Supplementary Figs 3 and 4). The electron 
density maps unambiguously demonstrate that U4 and G5 of the A 
codons UUU and UGC do not show the anticipated wobble inter- 
actions with G36 in tRNA,'" and U35 in tRNA™", respectively. 
Instead, U4*G36 and G5eU35 at the first and second positions of 
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the codon-anticodon duplexes form base pairs similar to a standard 
Watson-Crick GeC pair (Fig. 1c, e and Supplementary Fig. 5). GeC- 
like GeU or GeT pairs have been shown earlier for RNA and DNA in 
other X-ray structures’. When UsG is at the third codon-anticodon 
position we observe standard UeG wobble pairing (Fig. 1g, h). 
Unexpectedly, nucleotides A1493, A1492 and G530 of 16S rRNA in 
helix 44 (h44), which contact the first and the second pairs of the 
codon-anticodon helix, interact with these unusual U4*°G36 and 
G5eU35 pairs identically to the way they interact with canonical 
Watson-Crick base pairs C4°G36 and A5¢U35 (Fig. 1c-f). These find- 
ings are in contradiction with studies where these nucleotides were 
given a role as monitors and discriminators of canonical Watson- 
Crick pairs in the decoding process'*'®. Our structures show that 
G530, Al492 and A1493 form a static part of the decoding centre, 
defining its spatial and stereochemical properties (Fig. 2a, b). 

The observed non-wobble UeG differs from previous X-ray studies 
that were based on the 30S subunit alone”’ (Fig. 2c and Supplementary 
Figs 5 and 6). For example, for the study of the mismatch at the first 


b Cognate versus second mismatch 


Anticodon 


) Cognate 


oa] Near-cognate 


Cognate / tRNA’ 
Near-cognate / tRNA‘ 


Figure 2 The nature of the decoding centre. a, b, The overall conformations 
of universally conserved G530, A1492 and A1493 of 16S rRNA in the cognate 
structures are identical to those in the near-cognate models when the 
mismatches are at the first (a) or second (b) codon—anticodon positions. 

c, Differences between the position of the first uridine in the UUU codon base- 
paired to the GAG anticodon of tRNA2'“" from our 70S structure and from the 
30S model'”. d, Comparison of the anticodon loops of tRNA™ in the cognate 
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(red) and near-cognate (cyan) states. e, Rearrangements of rRNA helices h44 
and H69 in the near-cognate state upon binding of the aminoglycoside 
paromomycin (PAR). The near-cognate structures with tRNA°™ are shown (a 
similar effect of PAR is observed with tRNA,'*", see also Supplementary Fig. 
10). f, Magnified view of the changes in the A1493 phosphate position induced 
by PAR. Superimpositions in d, e and f were performed using 23S rRNA as 
reference. 
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position, 30S crystals soaked with an anticodon stem-loop of tRNA)" 
and a hexauridine (Us) mRNA displayed a classical wobble UeG pair. 
There the P codon was mimicked by the 3’-end of 16S rRNA, so the Ug 
mRNA could only bind to the A site and downstream, leading to a 
situation where the mRNA was not covalently linked between the P 
and A codons. By superimposing the A site GAG anticodons from our 
near-cognate tRNA,'“" structure and the 30S model we found that the 
first nucleotide of the A codon is positioned differently (Fig. 2c and 
Supplementary Fig. 6). Because it does not have the natural restraint 
coming from being covalently bound to the P codon, this first nucleo- 
tide in the A codon has the freedom to move so it can form a wobble 
UeG pair. However, in our structure the P/A mRNA kink specifically 
directs the first nucleotide of the A codon to form Watson-Crick like 
interactions with G36 of tRNA,/" (Fig. 1c). 

The positional restrictions imposed by the decoding centre on the 
first two near-cognate UeG codon-anticodon pairs may result in dif- 
ferences in geometry compared with the corresponding cognate 
helices (Supplementary Figs 7 and 8). Although, the resolution of 
the data sets does not allow us to determine the exact value for these 
deviations in the base pairs, the tendencies are clear. A noticeable 
change is an increase in buckling at the first and third codon-anticodon 
positions with the U4*G36 mismatch (Supplementary Fig. 7) and at the 
second and third positions with the G5*U35 mismatch (Supplemen- 
tary Fig. 8). These deviations could disturb the base stacking network 
within the near-cognate codon-anticodon helices and deform the 
entire anticodon loop structure™ (Fig. 2d, Supplementary Fig. 9 and 
Supplementary Movie 1). This deformation might influence the 
position of helix 69 (H69) of 23S rRNA, whose universally conserved 
A1913 protrudes into the decoding centre*””' (Fig. 2e). 

Additional structures of the near-cognate states determined in the 
presence of the miscoding aminoglycoside paromomycin (Supplemen- 
tary Tables 1 and 2) reveal a movement of H69 accompanied by 
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Figure 3 | The principle of decoding. a, Superposition of 23S rRNA from 
near-cognate and cognate structures with tRNA," shows identical domain 
closure in the near-cognate and cognate states. b, Representation of 
conformations of the core nucleotides composing the decoding centre 
depending on its functional states (data with the vacant 70S ribosome are 
unpublished). ¢, d, Illustration of the decoding principle: together with the 
constraints imposed on the A codon by the P/A kink coordinated by a 
magnesium ion (green sphere), the DC (h18, h44 and protein $12 from the 
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rearrangements of the intersubunit bridge B2a between h44 and H69 
(Fig. 2e and Supplementary Movie 2). These distortions are most 
probably caused by binding of the antibiotic that strongly shifts the 
A1493 phosphate group (Fig. 2f). Although this shift does not alter 
much the interactions of A1493 with U4*G36 and A1492/G530 with 
G5eU35 (Supplementary Figs 7g and 8h), these local changes modulate 
the B2a bridge. H69 is displaced towards the tRNA, which probably 
enhances the interaction surface of H69 with the tRNA D-stem. In the 
presence of paromomycin the position of H69 is closer to that observed 
for the cognate state (Supplementary Fig. 10). Furthermore, displace- 
ment of the A1493 phosphate group relaxes the decoding pocket from 
the side of the A codon (Fig. 2e) and changes the deformation of the 
near-cognate codon-anticodon helix (Supplementary Movie 2). This 
novel understanding of the paromomycin action therefore differs from 
the previously suggested mechanism in decoding where it was pro- 
posed to influence the monitoring capabilities of A1492 and A1493 
(ref. 18). The observed moderate structural rearrangements with 
paromomycin are consistent with its measured effect at the proofread- 
ing step”. 

We find that both near-cognate tRNAs induce rearrangements of 
the 30S subunit known as domain closure’” (that is, shoulder move- 
ment and head rotation) as described for cognate tRNA” (Fig. 3a). 
This implies that domain closure is an inherent quality of the ribosome 
in response to binding of any tRNA to the A site’? and is prerequisite 
for formation of the decoding pocket. 

Initially, the mechanism underlying the decoding process was 
deduced from pioneer X-ray structures of the isolated 30S subunit 
where crystals were soaked with U, and anticodon stem-loop mimick- 
ing mRNA and tRNA”. Besides those limitations, all attempts to 
model near-cognate states on the 30S subunit were performed in the 
presence of paromomycin, which, on the one hand, stimulated an 
ordered binding of the near-cognate tRNA analogs, but, on the other, 
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small subunit and H69 from the large subunit) restricts the allowed geometry of 
the first two nucleotides of the codon. No such restraints are imposed on the 
third base pair. A near-cognate tRNA with GeU in the first or second position is 
forced to form Watson-Crick-like base pairs (middle panels). This creates 
repulsion or requires energy for tautomerization (shown in pink), which by 
itself can be the source of the tRNA discrimination. The right panels illustrate 
the impossible situation when standard wobble base pairs (shown in red) occur 
in either the first or second positions of the codon-anticodon duplexes. 
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made it difficult to distinguish the independent effect of the mismatch 
on the 30S decoding centre’’. The cognate and near-cognate models of 
the 70S ribosome described here, with or without paromomycin, 
together with our previous structures'*”', give rise to novel insights 
into decoding with revisited roles for universally conserved nucleotides 
G530, A1492 and A1493 of 16S rRNA (Fig. 3b). Although these 
nucleotides form extensive contacts to the A-minor groove of a 
codon-anticodon helix, G530, A1492 and A1493 do not actively sense 
the correct Watson-Crick base-pairing geometry and thus do not 
discriminate against near-cognate tRNA. 

We propose that upon binding of cognate or near-cognate tRNA to 
the 70S ribosome, the small subunit undergoes domain closure around 
the anticodon loop of the tRNA. The closure results in formation of a 
tight decoding centre that restricts the first two nucleotides of the A 
codon to form exclusively Watson—Crick base pairs with the tRNA 
anticodon (Fig. 3c, d). Owing to our current data resolution, we cannot 
precisely identify the hydrogen bond pattern of the mismatches in the 
near-cognate codon-anticodon helices, but tautomerism is a plausible 
chemical mechanism. An alternative explanation for the tRNA dis- 
crimination source could be repulsion in the UeG pair (Fig. Ic, e). 
Energy expenditure for formation of tautomers (or repulsion energy) 
could constitute the sole cause for the very efficient rejection of near- 
cognate tRNAs by the ribosome**. Additionally, the observed 
deformation of the anticodon may lead to alterations in the codon- 
anticodon mini-helix and propagate through the rest of the near- 
cognate tRNA molecule, destabilizing it and causing the tRNA to 
dissociate from the ribosome (Supplementary Fig. 11). This corrobo- 
rates the idea that evolutionarily tuned sequences of tRNAs play an 
active role in the tRNA selection process***. Recent X-ray structures 
of the 70S ribosome with cognate tRNA and elongation factor Tu’”* 
demonstrated that binding of an anticodon loop of tRNA in the decod- 
ing centre is nearly identical to that shown for accommodated tRNA. 
This information prompts us to hypothesize that the same mechanism 
described here for the proofreading step governs the initial tRNA 
selection step. 


METHODS SUMMARY 


Ribosomes were purified from Thermus thermophilus cells as described before”’. 
For all complexes, mRNA, tRNA™* and tRNA,/" (or tRNA”) were present in 
fivefold stoichiometric excess of the ribosome concentration. Complexes with 
paromomycin were obtained by including the antibiotic (30 1M) into the incuba- 
tion mixture. Crystals were grown at 24°C by sitting-drop vapour diffusion as 
described before”’. All crystals belonged to space group P2,2;2, and contained two 
molecules per asymmetric unit. A very low dose mode with high redundancy was 
used for data collection?’. The structure”!, with tRNAs, mRNA and metal ions 
removed, was used for refinement with Phenix”. Throughout refinement, no base- 
pair restraints were used between tRNAs and mRNAs to avoid bias towards 
standard base-pair geometry. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Complex formation and crystallization. Ribosomes were purified from Thermus 
thermophilus cells as described before*'. The 30-nucleotide-long mRNA con- 
structs I-IV (see below) were purchased from Dharmacon. In all sequences the 
AUG start codon is underlined and the A codons are in bold. Purified native 
uncharged Escherichia coli tRNA™*, tRNA!" and tRNA” were supplied by 
Chemical Block. Aminoglycoside antibiotic paromomycin was purchased from 
Sigma-Aldrich. 

The cognate and near-cognate complexes were formed in 10 mM tris-acetate, 
pH 7.0, 40 mM KCl, 7.5 mM Mg(Ac), 0.5 mM DTT by incubating 70S ribosomes 
(3 uM) with mRNA-I, -II, -III or -IV and tRNA™* for 10 min at 37°C. Then 
tRNA,“ and tRNA?” were added to the mixtures with mRNA-I or -II and 
mRNA-III or -IV, respectively, and the complexes were further incubated for 
30 min. For all complexes mRNA, tRNA™* and tRNA»! (or tRNA!) were 
present in fivefold stoichiometric excess of the ribosome concentration. 
Complexes with paromomycin were obtained by including the antibiotic 
(30 4M) into the incubation mixture containing mRNA-II with tRNA," and 
mRNA-IV with tRNA™. Crystals were grown at 24°C by sitting-drop vapour 
diffusion as described before”. In agreement with previous results, initiator 
tRNA™* was found in the P site of all complexes and either tRNA," or 
tRNA’ was found in the A site (tRNA™* was easily distinguishable from 
tRNA," and tRNA?” based on the large variable loops in those tRNAs). In 
the complexes with mRNA III and IV, the E site was occupied by tRNA™. 
However, in complexes with mRNA I and II, the quality of the density did not 
allow identification of the E site tRNA which was then modelled as tRNA™*. 
mRNA-I 5’-GGCAAGGAGGU(U),AUGCUC(U)>-3’ (cognate for tRNA,'*). 
mRNA-II5’-GGCAAGGAGGU(U),AUGUUU(U)>-3’ (near-cognate for tRNA,“"). 
mRNA-III 5’-GGCAAGGAGGU(A),AUGUAC(A)-3’ (cognate for tRNA™). 
mRNA-IV 5'-GGCAAGGAGGU(A),AUGUGC(A)o-3’ (near-cognate for RNA™). 
Data collection, processing and structure determination. All crystals belong to 
space group P2,2;2; and contain two molecules per asymmetric unit. Data on all 
six complexes were collected at 100K at the Synchrotron Light Source, 
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Switzerland, using the Pilatus 6M detector. A very low dose mode was used and 
huge redundancy was collected”. The structure’’, with tRNAs, mRNA and metal 
ions removed, was used for refinement with Phenix*’. The initial model was 
correctly placed within each data set by rigid body refinement with each molecule 
as a rigid body. This was followed by rigid body refinement of individual subunits. 
After positional and B-factor refinement, the resulting electron density maps were 
inspected and the tRNAs and mRNA ligands were built in these unbiased maps. In 
all of these and the following refinement rounds, no base-pair restraints were used 
between tRNAs and mRNAs to avoid bias towards perfect base-pair geometry. 
After several cycles of manual rebuilding followed by positional and individual 
isotropic B-factor refinement, magnesium ions were added and a final refinement 
round took place. A summary of the crystallographic data and refinement statistics 
is given in Supplementary Tables 1 and 2. Supplementary Table 3 shows the 
average B-factor for the entire structure as well as average B-factors for the decod- 
ing centre and the codon-anticodon helix. From this table it is seen that the 
average B-factors for the substructure comprising the decoding centre (G530, 
A1492, A1493 from 16S and G1913 from 23S), as well as the codon-anticodon 
helices (nucleotides 34-36 of tRNA in the A site and the corresponding codon of 
mRNA), are less than the overall B-factor for the entire ribosome structure. 
Therefore it is clear that the decoding centre is part of the most accurately deter- 
mined parts of these models. 

To verify that the base-pair geometry described in the paper is correct, we 
performed a many extra independent refinement rounds with base-pair geometries 
restrained to various standards (Watson—Crick, wobble, etc.) so that we could be 
confident about the reported geometries. Supplementary Figs 3 and 4 show OMIT- 
averaged kick maps***! of the GeU mismatch for the two near-cognate complexes. 
These unbiased maps show that a wobble conformation of these base pairs would 
not fit into the electron density and clearly demonstrate that a Watson—Crick 
conformation is the only plausible fit. 


31. Praaenikar, J., Afonine, P. V., Guncar, G., Adams, P. D. & Turk, D. Averaged kick 
maps: less noise, more signal and probably less bias. Acta Crystallogr. D 65, 
921-931 (2009). 
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MAP and kinesin-dependent nuclear positioning is 
required for skeletal muscle function 


Thomas Metzger!**, Vincent Gache**, Mu Xu!, Bruno Cadot?, Eric S. Folker', Brian E. Richardson’, Edgar R. Gomes?** 


& Mary K. Baylies!?* 


The basic unit of skeletal muscle in all metazoans is the multi- 
nucleate myofibre, within which individual nuclei are regularly 
positioned’. The molecular machinery responsible for myonuclear 
positioning is not known. Improperly positioned nuclei are a hall- 
mark of numerous diseases of muscle’, including centronuclear 
myopathies*, but it is unclear whether correct nuclear positioning 
is necessary for muscle function. Here we identify the microtubule- 
associated protein ensconsin (Ens)/microtubule-associated protein 7 
(MAP7) and kinesin heavy chain (Khc)/Kif5b as essential, evolutio- 
narily conserved regulators of myonuclear positioning in Drosophila 
and cultured mammalian myotubes. We find that these proteins 
interact physically and that expression of the Kif5b motor domain 
fused to the MAP7 microtubule-binding domain rescues nuclear 
positioning defects in MAP7-depleted cells. This suggests that 
MAP7 links Kif5b to the microtubule cytoskeleton to promote 
nuclear positioning. Finally, we show that myonuclear positioning 
is physiologically important. Drosophila ens mutant larvae have 
decreased locomotion and incorrect myonuclear positioning, and 
these phenotypes are rescued by muscle-specific expression of 
Ens. We conclude that improper nuclear positioning contributes 
to muscle dysfunction in a cell-autonomous fashion. 

To identify the mechanisms governing nuclear positioning in muscle, 
we performed an F; recessive ethylmethane sulphonate mutagenesis 
screen using the apterousME-NLS::dsRed transgenic line (apRed)’, 
which expresses dsRed in the nuclei of the four lateral transverse muscles 
of the Drosophila embryo. At the end of embryogenesis, the myonuclei 
are distributed throughout the muscles (Fig. la, b). One mutation 
resulted in clustered nuclei near the ventral end of each lateral transverse 
muscle. Within each hemisegment these clusters resembled the Nike 
trademark, so we named the mutant swoosh (swo) (Fig. 1a). 

Further analysis of swo mutant embryos indicated that myoblast 
specification’, fusion®, muscle elongation®’ and attachment’ occur 
normally. The only notable difference in swo mutant muscles is a 
ventral bulge in the cell body that correlates with the cluster of nuclei 
(Supplementary Figs 1 and 2). These data suggest that the clustering of 
nuclei in swo mutants is a genuine nuclear positioning defect. 

To examine nuclear movement, we performed in vivo time-lapse 
imaging on apRed control and swo,apRed mutant embryos (Fig. 1b). In 
stage 14 apRed control embryos (10.5 h after egglaying (AEL)), nuclei 
in the four lateral transverse muscles group near the ventral end of each 
growing myotube in the hemisegment. During stage 15 (11.5 h AEL) 
after the completion of myoblast fusion and muscle attachment, the 
nuclei within each myotube begin to separate into distinct dorsal and 
ventral clusters. These clusters finish their migration during stage 16 
(15h AEL) and each cluster resides at an opposite end of the myotube. 
Finally, during stage 17 (18h AEL), the six to eight nuclei in each 
lateral transverse muscle move from the clusters and distribute evenly 
throughout each lateral transverse muscle. The entire process of 
nuclear movement spans 7h (Supplementary Movie 1). Time-lapse 


imaging of swo,apRed mutant embryos demonstrated that the nuclei 
fail to undergo the initial separation into dorsal and ventral clusters 
during stage 15 and remain clustered at stage 17, indicating that swo 
does not simply cause a delay in myonuclear positioning (Supplemen- 
tary Movie 2). In addition, the nuclei in every other somatic muscle 
examined in swo mutant embryos also failed to separate (Supplemen- 
tary Fig. le). 

We determined that the swo flies carry a nonsense mutation in 
ensconsin (ens), a gene encoding a MAP. Expression of haemagglutinin- 
tagged Ens specifically in the developing mesoderm and muscle rescued 
the nuclear positioning phenotype in swo embryos, confirming that the 
mutation in ens is responsible for the nuclear positioning defect. 
Depletion of ens solely in the developing mesoderm and muscle by 
RNA-mediated interference recapitulated the phenotype we observed 
in the swo mutant (renamed ens*”°), and known ens mutant alleles 
failed to complement ens*”’ (Supplementary Figs 1c and 2a, and data 
not shown). These results indicate that ens is required autonomously 
within muscle for proper myonuclear positioning. We also found that 
nuclear position was disrupted in ens*"”’ oocytes* (Supplementary Fig. 
3a, b), but not in ens*”” photoreceptor cells (Supplementary Fig. 3c, and 
data not shown)’, indicating that Ens is necessary for some, but not all, 
nuclear positioning processes in Drosophila. 

Four different genes in the mouse genome encode Ens orthologues: 
Map7 (E-MAP-115, Ensconsin), Map7D1, Map7D2 and Map7D3 
(Supplementary Fig. 2c). Using cultures of C2C12 myoblasts and 
primary mouse myoblasts, we examined the effect on nuclear position- 
ing in myotubes depleted of each Ens orthologue by short interfering 
RNA (siRNA) (Fig. 1 and Supplementary Fig. 4). MAP7 depletion 
caused a significant increase in the aggregation of nuclei within a 
myotube but did not affect myoblast fusion or myotube differentia- 
tion (Fig. 1c,d, Supplementary Fig. 4d,e and Supplementary Movies 
3-6). Depletion of MAP7D1, MAP7D2 or MAP7D3 did not affect 
nuclear positioning (Supplementary Fig. 4b). Expression of full-length 
MAP7 in MAP7-depleted myotubes restored nuclear alignment (Fig. 1d 
and Supplementary Fig. 6h). Thus, MAP7 is required for nuclear 
positioning in both Drosophila muscles and cultured mammalian 
myotubes. 

No major defects on the microtubule network were observed in 
ens’ mutant embryos or MAP7-depleted myotubes (Supplemen- 
tary Figs 2g—h and 6g). Therefore, to gain mechanistic insight, we 
conducted a yeast two-hybrid screen to find Ens-binding proteins 
and identified kinesin heavy chain (Khc) (Supplementary Fig. 5). 
Both a kinesin-null mutation (khc*) and a motor-dead mutation 
(khc*)° disrupted myonuclear positioning without affecting muscle 
elongation and attachment (Fig. 2a and Supplementary Fig. 5a). 
Likewise, siRNA-mediated depletion of the mammalian Khc orthologue 
KifSb (ref.11) in primary and C2C12 myotubes disrupted nuclear 
alignment similarly to MAP7 depletion without affecting myotube 
formation or differentiation (Fig. 2b, c, Supplementary Figs 4d-e 
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Figure 1 | Myonuclear positioning requires ensconsin/MAP7. 
a, Hemisegments from control and ens*”° embryos at the beginning (stage 14) 
and end (stage 17) of nuclear migration. Muscles are immunostained for 
tropomyosin (green); nuclei are immunostained for dsRed (red). Scale bar, 
10 pm. b, Time-lapse images of nuclear migration in hemisegments of control 
and ens*"”’ embryos. Elapsed imaging times are shown along the top. Scale bar, 


10 pm. c, Time-lapse images of nuclear migration in control and Map7- 


and 6, and Supplementary Movies 7 and 8). Expression of full-length 
KifSb in Kif5b-depleted myotubes restored nuclear alignment (Fig. 2c 
and Supplementary Fig. 6h). These data indicate that Khc, with a 
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Figure 2 | Kinesin is required for myonuclear positioning. a, Single 
hemisegments from stage 17 (22 h AEL) embryos of indicated genotypes. Scale 
bar, 10 ttm. b, Representative immunofluorescence images of control and 
Kif5b-depleted C2C12 myotubes differentiated for 4 days and immunostained 
for myosin heavy chain (green) and 4’,6-diamidino-2-phenylindole (red). Scale 
bar, 15 pum. c, Histogram of nuclear distribution in C2C12 myotubes that were 
untreated (control) or treated with the indicated siRNA or KifS5b-depleted cells 
expressing full-length Kif5b (no. 781 + FL-Kif5b). Error bars indicate s.e.m. 
Asterisk, P < 0.05; two asterisks, P < 0.01; three asterisks, P< 0.001 (scrambled 
versus experimental condition). 


2 | NATURE | VOL 000 | 00 MONTH 2012 


ae 14 Stage 15 Brel 16 cee 17 a AZ Bea 17 


5h 7.5h 10h 


Map7 siRNA 


1.5h 


m Aligned 
m Aggregated 
m Other 


[o>] 
oO 


aS 
oO 


20 


Percentage of total myotubes & 


Map7 siRNA 


depleted C2C12-H1B-GFP myotubes. Elapsed imaging times are shown along 
the top. Nuclei are stained red. Scale bar, 15 tum. d, Nuclear distribution in 
C2C12 myotubes that were untreated (control) or treated with the indicated 
siRNA and Map7-depleted cells expressing full-length MAP7 (no. 118 + FL- 
MAP?7). Error bars indicate s.e.m. Asterisk, P < 0.05; two asterisks, P< 0.01; 
three asterisks, P< 0.001 (scrambled versus experimental condition). 


functional motor domain, is required for proper nuclear positioning 
in both Drosophila muscles and cultured mammalian myotubes. 

We next examined whether Ens and Khc interact functionally to 
position myonuclei. Embryos heterozygous for ens or khc position 
their myonuclei normally. However, embryos that were doubly 
heterozygous for ens*”° and khc showed defective nuclear positioning 
similar to that in the homozygous ens embryos (Supplemen- 
tary Fig. 5). This demonstrates that ens and khc interact functionally 
and that the dosage of these genes is important for proper nuclear 
positioning. 

We next tested whether Ens/MAP7 and Khc/Kif5b interact physically. 
Endogenous Kif5b co-immunoprecipitated with expressed full-length 
green fluorescent protein (GFP)-tagged MAP7, independently of micro- 
tubules (Fig. 3a, b). Using fragments of MAP7, we mapped the Kif5b 
interaction to the carboxy-terminal coiled-coil domain of MAP7 (CC2), 
which was confirmed by reciprocal co-immunoprecipitation (Fig. 3a—-c). 
Similarly, fragments of Kif5b were used to identify the C-terminal region 
(KifS5b-motorless) as the MAP7-interacting domain (Fig. 3a, d, e). The 
C-terminal coiled-coil domain of MAP7 therefore interacts with the 
C-terminal region of Kif5b (Fig. 3). 

To test whether the specific interaction identified above between 
Ens/MAP7 and Khc/Kif5b is required for nuclear positioning, we 
expressed chimaeras consisting of the Kif5b motor domain’’ fused to 
fragments of MAP7 that contained (K-EMTB and K-EMTB-M) or 
lacked (K-N-term) the microtubule-binding domain (EMTB)”* in 
Map7 siRNA-depleted myotubes (Fig. 3a, f, g). Expression of chimaeras 
that contained the MAP7 EMTB rescued the alignment of nuclei in 
MAP7-depleted myotubes, whereas expression of the EMTB domain 
alone, or a chimaera that lacked the MAP7 EMTB (K-N-term) did not 
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Figure 3 | Kinesin and ensconsin/MAP7 interact to regulate nuclear 
position. a, Kif5b, MAP7 and chimaera constructs amino-terminally tagged 
with GFP (MAP7 and chimaeras) or c-Myc (Kif5b). Numbers indicate amino 
acids. b, Western blot with indicated antibodies (right) of GFP-MAP7 (top) 
immunoprecipitations (IP). c, Anti-GFP western blot of Kif5b 
immunoprecipitations from C2C12 cells expressing indicated constructs (top). 
Asterisk indicates anti-Kif5b IgG. d, Anti-c-Myc western blot from C2C12 cells 


(Fig. 3f, g). These results indicate that the function of MAP7 in nuclear 
positioning is to link Kif5b to microtubules. 

To investigate the physiological impact of mispositioned nuclei on 
muscle function in vivo, we examined larval motility’*, ens mutant 
larvae move significantly more slowly than controls (Fig. 4b and 
Supplementary Fig. 7d). Moreover, analysis of the muscle structure 
in tracked larvae revealed that the myonuclei in ens”? mutants were 
33% closer together than in controls (Fig. 4a, c and Supplementary Fig. 
7a, e). Sarcomere structure’’, t-tubules’®, neuromuscular junctions”, 
nuclei number (Supplementary Fig. 7c, e, f, h, i) and mitochondrial 
localization (Supplementary Fig. 8) were unchanged in ens mutants 
compared with controls. Expression of Ens in the mesoderm and 
muscle during embryonic development or in muscle starting at the 
L2 larval stage in ens mutants rescued the nuclear positioning and 
motility defects in L3 larvae (Fig. 4a—-c and Supplementary Fig. 7a, d, 
g), demonstrating a strong correlation between aberrant myonuclear 
position and decreased muscle function. 

These experiments define a subset of the nuclear behaviours that 
occur during muscle differentiation, and demonstrate conservation 


ad 
.. aS 


Map7 siRNA 


expressing Kif5b constructs. e, Anti-c-Myc western blot from GFP 
immunoprecipitations with C2C12 myotubes expressing the indicated 
constructs. f, Nuclear distribution in Map7-depleted C2C12 myotubes 
expressing the indicated chimaeras. Error bars indicate s.e.m. Asterisk, 
P<0.05; two asterisks, P< 0.01. g, Map7-depleted C2C12s expressing 
indicated chimaeras. Anti-GFP is shown in green; nuclei are shown in red 
(stained with 4’,6-diamidino-2-phenylindole). Scale bar, 15 um. 


across species at both the cellular and molecular levels. We find that 
Ens/MAP7 and Khce/Kif5b are critical for myonuclear positioning and 
are not required for fusion and myofibre formation. Moreover, these 
proteins interact genetically and physically, and the physical inter- 
action is necessary for proper nuclear positioning. Furthermore, our 
results argue that the correct spacing of myonuclei is required for 
proper muscle function. 

Within a myotube, distinct microtubule networks emanate from 
each myonucleus, producing regions of overlapping antiparallel 
microtubules’**°. Furthermore, Ens/MAP7 functions to load Khc/ 
Kif5b onto microtubules®. The results presented here therefore suggest 
a molecular mechanism for nuclear positioning. Adjacent nuclei could 
be positioned relative to each other by Ens/MAP7 and Khc/Kif5b 
interacting with and sliding antiparallel microtubules that have their 
minus ends anchored to the nuclear envelope, in a similar manner to 
the mechanism by which the Eg5 kinesin facilitates spindle elonga- 
tion”! (Supplementary Fig. 10). This mechanism is distinct from the 
kinesin-1-dependent nuclear position in the Caenorhabditis elegans 
hypodermis, in which Khc is anchored to the nucleus by kinesin light 
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Figure 4 | Ensconsin/MAP?7 is required for intracellular muscle 
organization and efficient larval locomotion. a, Maximum-intensity xy 
projections of muscle VL3 from segment A3 of L3 larvae from the indicated 
genotypes stained for actin (red), nuclei (white) and Z-bands (green). Scale bar, 
20 um. b, Average velocity of migration for L3 larvae of the indicated genotypes. 


chain (Klc) and the KASH-domain protein Unc-83, which drive the 
nucleus unidirectionally on microtubules”. We find that klc mutants 
in Drosophila do not show defects in the positioning of nuclei that we 
describe here. Additionally, overexpression of dominant-negative 
(dn)-KASH-domain constructs in C2C12 myotubes that remove 
KASH proteins from the nuclear envelope and disrupt the function 
of the LINC (linker of nucleoskeleton and cytoskeleton) complex and 
nuclear anchoring” does not affect nuclear positioning (Supplemen- 
tary Fig. 9). Taken together, these results suggest that kinesin is not 
interacting directly with the nucleus; they therefore provide evidence 
for the sliding mechanism. 

Patients suffering from various muscle diseases, including centronuc- 
lear myopathies, show muscle weakness with mislocalized myonuclei’. 
However, it remains unclear whether these mispositioned nuclei con- 
tribute to muscle weakness or are simply a result of impaired myofibre 
function®. Our results show that the ability to position myonuclei 
correctly correlates with better muscle function. Because nuclear 
misposition is the first observable defect in zygotic ens mutant embryos, 
we speculate that this is a cause of the observed muscle weakness. 
Mispositioned nuclei might lead to the observed muscle weakness as a 
result of a disruption in the size and spacing of myonuclear domains 
throughout the muscle” or in the distribution of subcellular structures 
whose positioning depends on correct nuclear spacing. We rescue 
nuclear position, muscle function, and viability not only by supplying 
Ens during embryogenesis but also by expressing Ens solely in mature 
muscles later in larval development. We therefore propose that correct- 
ing nuclear positioning defects in patients with muscle diseases might 
benefit muscle strength and improve muscle function. The model 
systems that we present here provide a platform with which to identify 
additional nuclear positioning genes and assess their function, and to 
screen for potential drugs to alleviate muscle weakness due to disease. 


METHODS SUMMARY 


Drosophila stocks used included apME-NLS::dsRed (ref. 4), Df(3L)GN34 (ref. 26), 
ens'#P36480 (Bloomington), ens?! (Harvard), Df-ens*°?7 7, ensAN, ens4© (ref. 8), 
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Each of the indicated Gal4 drivers is expressing UAS-ensHA in a homozygous 
ens*”° mutant background. Error bars, s.e.m. Two asterisks, P< 0.01; three 

asterisks, P< 0.001. c, Nearest-neighbour analysis of nuclei within muscle VL3 
from segment A3 from L3 larvae of the indicated genotypes. Error bars indicate 


s.e.m. Asterisk, P< 0.05; three asterisks, P< 0.001. 


khc’, khc*, khc*? (ref. 10), kic***** (ref. 27), twi-Gal4 (ref. 4), alpha-Gal4, G7-Gal4 
(ref. 28) (from K. Broadie), and UAS-ens-IR lines 106207 and 18491 (VDRC). 
UAS-ensHA transgenic flies were generated by BestGene Inc. Mouse cells were 
transfected with siRNA using Lipofectamine RNAiMAX (Invitrogen) or with 
DNA using Lipofectamine 2000 (Invitrogen). Primary antibodies used were rabbit 
anti-dsRed (Clontech), rat anti-tropomyosin (Abcam), mouse anti-myosin heavy 
chain (from S. Abmayr), rabbit anti-Zasp (from F. Schéck), chicken anti-B-gal 
(Abcam), mouse anti-a-tubulin (Sigma), rat anti-ensconsin (from P. Rorth), 
guinea pig anti-Kriippel (from J. Reinitz), rabbit anti-Eve (from M. Frasch), mouse 
anti-B-PS integrin (DSHB), rabbit anti-vestigial (from S. Carroll), fluorescein 
isothiocyanate-conjugated anti-horseradish peroxidase (Jackson ImmunoResearch), 
mouse anti-discs large (DSHB), rabbit anti-ATP synthase” (from H. Duan), rat 
anti-DE-cadherin (DSHB), mouse anti-chaoptin 24B10 (DSHB), rat anti-elav 
(DSHB), Alexa Fluor 488-conjugated wheatgerm agglutinin (Invitrogen), MF20 
(DSHB), rabbit anti-KHC (Santa Cruz), mouse anti-c-Myc (Roche) and rabbit 
anti-GFP (Invitrogen). Secondary antibodies were either biotinylated (Vector 
Laboratories and Jackson ImmunoResearch) or conjugated to Alexa Fluor 488, 
555 or 647. The fusion index”, sarcomere length’, bouton number” and larval 
velocity’* were quantified as described, with minor modifications (see Methods). 
The yeast two-hybrid screen was performed with full-length Ens by Hybrigenics 
SA Services using a 0-24-h Drosophila complementary DNA library. Standard 
protocols were used for immunoprecipitation, western blotting and quantitative 
polymerase chain reaction experiments and are described in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Flies and husbandry. Stocks were grown and maintained under standard condi- 
tions on standard cornmeal medium. Crosses were performed at 25 °C. Drosophila 
stocks: apME-NLS::dsRed (ref. 4), Df(3L)GN34 (ref. 26), ensitP36480 (Bloomington), 
ens®7!?! (Harvard), ens“, ens4°, Df-ens*?7” (ref. 8), UAS-ensHA (this study), 
khe®, khc*, khc?? (ref. 10), klc®***4 (ref. 27), and UAS-ens-IR lines 106207 and 18491 
(VDRC). Mutant alleles were balanced and identified using CyO P/w+ en1 lacZ] 
or CTG [CyO,twi-GAL4,UAS-2xeGFP] (ref.26), TM3 Sb1Dfd-lacZ or TTG 
[TM3,twi-GAL4, UAS-2xeGFP] (ref.31). Gal4 lines used in this work were twi- 
Gal4 (ref.4), which is specifically expressed in the mesoderm and developing 
muscle (2.5-13h AEL), and alpha-Gal4 and G7-Gal4 (ref.28) (gift from 
K. Broadie), which are specifically expressed in the larval muscles beginning at 
the late L1 and early L2 larval stages and perdures until pupation. 
Ethylmethane sulphonate mutagenesis and screening. A detailed description of 
the screen and its findings are available from the authors. In brief, males from an 
isogenized stock carrying the apME-NLS::dsRed transgene on the third chro- 
mosome were mutagenized with 35 mM ethylmethane sulphonate dissolved in 
1% sucrose in accordance with a standard protocol”. Individual mutant lines were 
established over the TTG balancer. Live F; progeny were screened under fluor- 
escence, and embryos homozygous for the mutant chromosome were identified by 
an absence of the balancer GFP. Deficiency mapping, complementation and 
sequencing were used to identify the genetic lesion. 
Phylogenetic analysis. Drosophila, human and mouse MAP7 orthologues were 
aligned using LALIGN (http://www.ch.embnet.org/software/LALIGN_form.html) 
and the phylogenetic tree was created using Vector NTI (Invitrogen). 
Cell culture. C2C12 myoblasts were grown and differentiated for 4days as 
described**. Primary myoblasts were isolated and differentiated for 3 days as 
described*°™, 
Constructs. Map7 fragments were cloned from a C2C12 cDNA library into 
pEGFP-C1 vector (Invitrogen), using the following primers: N-term (1-244), 
5'-ATGGCGGAGCAGGGAGC-3’ (forward) and 5’-CTAGCTGTTTTTCTCG 
TTCCTC-3’ (reverse); EMTB-M (261-1351),5'-ATGTGGCTAGAGAGAGAA 
GAACGAG-3' (forward) and 5’-TAGGAACTACACCGACAGTCACAG-3’ 
(reverse); M-CC2-Cterm (859-2187), 5’-ATGACCATTCATGGACTAGCG 
AG-3' (forward) and 5’-AACTTCTGCGGTCTGTTGT-3’ (reverse); M-CC2 
(859-1806), 5'-ATGACCATTCATGGACTAGCGAG-3’ (forward) and 5’-TT 
TCTTATCAGCGGTCTCTGTCCTC-3’ (reverse); CC2-Cterm (1337-2187), 
5'-ATGGGCGGTGTAGTTCCTAAGACTTCTG-3’ (forward) and 5'-AACTT 
CTGCGGTCTGTTGTG-3’ (reverse); CC2 (1337-1806), 5'-ATGGGCGGTGT 
AGTTCCTAAGACTTCTG-3’ (forward) and 5'-TTTCTTATCAGCGGTCTCT 
GTCCTC-3' (reverse); M (859-1351), 5’-ATGACCATTCATGGACTAGC 
GAG-3’ (forward) and 5’-TAGGAACTACACCGACAGTCACAG-3’ (reverse). 

Full-length Map7 (NM_008635) was made from the Map7 fragments and was 
cloned into pEGFP-C1. EMTB-Map7 was made by cloning the fragment consist- 
ing of base pairs 1-947 from full-length Map7 into the pEGFP-Cl1 vector. K-N- 
term, K-EMTB and K-EMTB-M chimaera constructs were made by cloning the 
K560 fragment from pET17b-K560-GFP (gift from R. Vale) into N-Term, EMTB 
and EMTB-M Map7 pEGFP constructs, respectively. Motor-Kif5b, S-Motorless- 
Kif5b and Motorless-Kif5b c-Myc were a gift from G. Kreitzer. FL-Kif5b was made 
by cloning the K560 fragment with a C-terminal region of full-length Kif5b (gift 
from C.C. Hoogenraad) into pEGFP C1 construct. EGFP-KASH2 and EGFP- 
KASH2ext were a gift from D. Hodzic”. 

siRNA sequences (Ambion): Map7, 5'-CAGAUUAGAUGUCACCAAUTT-3’ 
(no. 118), 5’-CCAUGAAUCUUUCGAAACATT-3’ (no. 119) and 5'-ACUUAC 
CUGUUGGAUCAAATT-3’ (no. 120); Map7D1, 5'-GGAACAGAGGGAACG 
CGAATT-3’ (no. 710), 5’-ACGUGGACUCUAUAAUCAATT-3’ (no. 711) and 
5'-GCAAUCCAGCGGUCAGUGATT-3’ (no.712); Map7D2, 5'-CUUUCAAC 
AUCAACCAUGATT-3' (no. 436), 5’-CUUGAUGACUGUAACAAAATT-3' 
(no. 437) and 5’-GGUCUCCUGUGAAGUAUUATT-3’ (no.438); Map7D3, 
5'-GGACGACAUCUCUAAAGUUTT-3’ (no. 000), 5’-GAACUAUCCUCCAU 
AGUGATT-3’ (no. 001) and 5'-CUGAUGAGGUUGAUACCAATT-3’ (no. 002); 
Kif5b, 5'-GCAAGAAGUAGACCGGAUATT-3’ (no. 781), 5’-GCUGUUAAU 
UAUGAUCAGATT-3’ (no. 782) and 5’-GACAUGUCGCAGUUACAAATT-3’ 
(no. 783). 
Transfections. Myoblasts were transfected with siRNA using Lipofectamine 
RNAiMAX (Invitrogen) or DNA using Lipofectamine 2000 (Invitrogen). H1B- 
GFP stable cell line was made by transfecting cells with H1B-GFP (gift from R. 
Vallee). 
qPCR. Messenger RNA from C2C12 cells was isolated with the RNeasy Micro Kit 
(Qiagen), and cDNA was prepared with the Transcriptor first-strand cDNA 
synthesis kit (Roche). Quantitative analyses (qPCR) were performed using 
CyberGreen kit in a LightCycler 480 II system (Roche), using the following 
primers: KifSb-F, 5'-GGAGGCAAGCAGTCGTAAAC-3’; Kif5b-R, 5'-TCTAG 


TGTTGGGAAGCAGCA-3’; Map7-F, 5'-TGAAACCAATTTTGGCCTTT-3’; 
Map7-R, 5'-AAAACCAGAGGGAGGAGCTG-3’; Hprt1-F, 5’-GTTAAGCAG 
TACAGCCCCAAA-3'; Hprt1-R, 5'-AGGGCATATCCAACAACAAACTT-3’. 
Immunoprecipitations. C2C12 cells were transfected with GFP-tagged or c-Myc- 
tagged constructs. After 24h, cell lysates were prepared using lysis buffer (10 mM 
Tris-HCl pH 7.4, 150mM NaCl, 0.5 mM EDTA, 0.5% Nonidet P40, 1 mM phe- 
nylmethylsulphonyl fluoride) containing protease inhibitor cocktail (Roche) and 
0.6 4M nocodazole (Sigma). Immunoprecipitation of GFP-fusion proteins or 
endogenous Kif5b was performed at 4°C using the magnetic GFP-Trap kit 
(Chromotek) or Protein A magnetic beads (Millipore) coated with Khc antibody, 
respectively. 

Western blot. Extracts were loaded on a 4-12% gradient gel (Invitrogen) and 
transferred using iBlot (Invitrogen). 

Immunohistochemistry, immunofluorescence, microscopy, and image ana- 
lysis. Embryos were collected at 25°C on apple-juice agar plates and were fixed 
as described* except for tubulin staining, for which the embryos were fixed in equal 
volumes of 10% formalin (Sigma) and heptane (Fisher Scientific). Larvae were 
dissected and fixed in 4% paraformaldehyde (Electron Microscopy Sciences) in 
relaxing buffer as described previously'****’. Eye discs were dissected from 
wandering third-instar larvae and fixed in 4% paraformaldehyde. Ovaries from 
females that were fed on fresh yeast for 2 days were dissected in PBS and fixed in 
4% paraformaldehyde. All fluorescent stainings were mounted in Prolong Gold 
(Molecular Probes); otherwise, embryos were mounted in Araldite. 

Mouse cells were fixed using 3.7% paraformaldehyde (Sigma) and permeabilized 
with 0.5% Triton. All fluorescent stainings were mounted in Fluoromount G 
(Southern Biotech). 

Antibodies were preabsorbed (PA) where noted, and used at the specified final 
dilutions: rabbit anti-dsRed (1:400; Clontech), rat anti-tropomyosin (1:1,000; 
Abcam), mouse anti-myosin heavy chain (1:400; gift from S$. Abmayr), rabbit 
anti-Zasp (1:400; gift from F. Schock), chicken anti-B-gal (1:1,000; Abcam), mouse 
anti-o.-tubulin (1:500; Sigma), rat anti-ensconsin (1:100; gift from P. Rorth), 
guinea pig anti-Kriippel (PA; 1:2,000; gift from J. Reinitz), rabbit anti-Eve (PA; 
1:3,000; gift from M. Frasch), mouse anti-B-PS integrin (1:50; DSHB), rabbit anti- 
vestigial (PA; 1:50; gift from S. Carroll), fluorescein isothiocyanate-conjugated 
anti-horseradish peroxidase (1:500; Jackson ImmunoResearch), mouse anti-discs 
large (1:200; DSHB), rabbit anti-ATP synthase”? (1:100; gift from H. Duan)), rat 
anti-DE-cadherin (1:100; DSHB), mouse anti-chaoptin 24B10 (1:200; DSHB), rat 
anti-elav (1:50; DSHB), mouse anti-myosin heavy chain MF20 (1:500; DSHB), 
rabbit anti-KHC (1:2,500; Santa Cruz), mouse anti-c-Myc (1:5,000; Roche) and 
rabbit anti-GFP (1:2,500; Invitrogen). Biotinylated secondary antibodies (Vector 
Laboratories and Jackson ImmunoResearch) and the Vectastain ABC kit (Vector 
Laboratories) were applied for non-fluorescent stainings. Alexa Fluor 488- 
conjugated, Alexa Fluor 555-conjugated and Alexa Fluor 647-conjugated secondary 
antibodies, Alexa Fluor 546-conjugated and Alexa Fluor 647-conjugated phalloidin, 
Alexa Fluor 488-conjugated wheatgerm agglutinin (Invitrogen), and Hoechst 
33342 or 4',6-diamidino-2-phenylindole were used for fluorescent staining 
(Invitrogen). 

Fluorescent images of Drosophila were acquired on a Leica SP5 laser scanning 
confocal microscope running the LAS AF 2.2 software (objectives used: 20X 0.70 
numerical aperture (NA) HC PL APO multi-immersion, 40 1.25 NA, 63X 1.4 
NA or 100X 1.43 NA HCX PL APO oil-immersion). Non-fluorescent images were 
acquired on a Zeiss Axiophot microscope. Image stacks were analysed and pro- 
cessed using Volocity (Improvision), ImageJ, and Photoshop CS4 (Adobe). 

Fluorescent images of mouse cells were acquired with a Nikon Ti microscope 

equipped with a CoolSNAP HQ2 camera (Roper Scientific), an xy motorized stage 
(Nikon) and a 40X 1.0NA PL APO oil-immersion objective, using Metamorph 
Software (Molecular Devices). 
Quantification methods. Nuclear positioning was quantified in mammalian 
myotubes containing at least three nuclei, and myotubes were classified as follows: 
‘aligned’, more than 70% of the nuclei in a myotube aligned along the same axis; 
‘aggregated’, more than 70% of the nuclei did not align along the same axis; ‘other’, 
nuclei in a myotube did not fall into either category. Fusion index was quantified as 
described”. 

A nearest-neighbour analysis in Drosophila was conducted to determine the 
average distance between a nucleus and its single closest neighbour within a 
muscle. To identify the closest neighbour for each nucleus in a muscle, the distance 
between the centre of a nucleus and the centre of each of the closely surrounding 
nuclei (usually four to eight) was measured. The shortest distance identified the 
closest nucleus or ‘nearest neighbour’ and this distance was recorded. A nearest 
neighbour was identified for each nucleus in every muscle analysed and the nearest 
neighbour distances were averaged to give an average ‘nearest-neighbour’ distance 
for the average space between one nucleus and the next closest nucleus in a muscle. 
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Sarcomere length was defined as the distance between the Z-lines labelled with 
the anti-ZASP antibody'’. Sarcomere length was measured at five separate regions 
per muscle and in the same five regions for each muscle examined. 

Neuromuscular junctions were revealed by staining with horseradish peroxidase 
or Dlg, and quantified as previously described". 

The position of the oocyte nucleus was determined by measuring the distance 

from the anterior border of the oocyte to the centre of the nucleus. 
Time-lapse imaging. Drosophila embryos were prepared as described*”*. Time- 
lapse sequences were acquired using a Zeiss Axio Imager.Z1 with a 20X 0.75 NA 
Plan-Apochromat dry objective. All time-lapse series were taken as a set of z-stacks 
over time (four-dimensional imaging) with optical sections every 3 tm. Only a 
single z-section with the greatest number of nuclei in focus was selected for each 
time point. Images were processed with ImageJ and compiled into movie by using 
Apple Quicktime. Single images of live embryos were acquired on a Zeiss Axiophot 
microscope. 

Time-lapse imaging of mouse cells was done as reported”. 

Yeast two-hybrid screen. Yeast two-hybrid screening was based on a full-length 
Ens as bait and performed by Hybrigenics S.A Services using a 0-24-h embryonic 
Drosophila cDNA library; 99 million interactions were analysed, and 10% of the 
high-confidence clones that were recovered from the screen were Khe. 

Larval behaviour. Larval behaviour was assessed as described previously, with 
minor modifications'*”’. In brief, embryos (stages 16 and 17) were selected for the 
presence of clustered apRed nuclei and/or the absence of the fluorescent balancer. 
Selected embryos were placed on a yeast-coated apple-juice plate overnight at 
25 °C, and L1 larvae were selected on the following day and placed into vials of 
standard food containing bromophenol blue. Larvae were picked from the vial 
3 days later and tracked. Larvae were tracked individually as they migrated towards 
a single odour source (0.25 M ethyl butyrate; Sigma) and recorded for 5 min witha 
charge-coupled device camera until they reached the odour source or until they 
contacted any of the walls of the apparatus. Images were processed by Ethovision 
software (Noldus). 
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Germline transformation and constructs. UAS-ensHA DNA was constructed by 
PCR-amplifying the full-length cDNA from the ens RA transcript with primers to 
add a haemagglutinin tag to the C-terminal end of the protein followed by an 
EcoRI restriction site. Primers used were 5'-GCCGAATTCCACCATGGCGA 
GTCTTGGGGGC-3’ (5’) and 5’-GCCGAATTCTTATCAAGCGTAATCTGGA 
ACATCGTATGGGTACAGCAGCGATATATCTTITATTTTCGTG-3’ (3’). Amplified 
cDNA was introduced into the Uni-ZAP XR vector (a derivative of pBlueScript 
SK ; Stratagene) using EcoRI/Xhol. This DNA was used by BestGene Inc. to 
generate transgenic flies. 

Statistics. Statistical analysis was performed with Prism (GraphPad Software Inc.). 
Pairwise comparisons were made with Student’s t-test, and group comparisons 
were made by analysis of variance followed by Tukey’s posthoc analysis. In nuclear 
positioning analysis in C2C12 cells, Student’s t-tests were performed between 
scrambled siRNA and experimental condition for each category (aligned, aggre- 
gated and other). 


31. Halfon, M. S. et al. New fluorescent protein reporters for use with the Drosophila 
Gal4 expression system and for vital detection of balancer chromosomes. Genesis 
34, 135-138 (2002). 

32. Lewis, E. Method of Feeding Ethane Methylsulfonate (EMS) to Drosophila Males. 

(Drosophila Information Service, 1968). 

33. Parlakian, A. et al. Skeletal muscle phenotypically converts and selectively inhibits 

metastatic cells in mice. PLoS ONE 5, e€9299 (2010). 

34. DePalma, C. etal. Nitric oxide inhibition of Drp1-mediated mitochondrial fission is 

critical for myogenic differentiation. Cel! Death Differ. 17, 1684-1696 (2010). 

35. Khatau, S.B. eta/. A perinuclear actin cap regulates nuclear shape. Proc. Nat! Acad. 

Sci. USA 106, 19017-19022 (2009). 

36. Brent, J., Werner, K. & Mccabe, B. Drosophila larval NMJ dissection. J. Vis. Exp. 

10.3791/1107 (2009). 

37. Brent,J., Werner, K.& McCabe, B. D. Drosophila larval NMJ immunohistochemistry. 

J. Vis. Exp. (25) e1108, doi:10.3791/1108 (2009). 

38. Richardson, B. E., Beckett, K. & Baylies, M. K. Live imaging of Drosophila myoblast 

fusion. Methods Mol. Biol. 475, 263-274 (2008). 

39. Louis, M., Piccinotti, S.& Vosshall, L.B. High-resolution measurement of odor-driven 

behavior in Drosophila larvae. J. Vis. Exp. (11) e638, doi:10.3791/638 (2008). 


©2012 Macmillan Publishers Limited. All rights reserved 


Lait ELS. 


doi:10.1038/nature10923 


Trans-synaptic Teneurin signalling in neuromuscular 
synapse organization and target choice 


Timothy J. Mosca!, Weizhe Hong"*, Vardhan S. Dani!, Vincenzo Favaloro! & Liqun Luo! 


Synapse assembly requires trans-synaptic signals between the pre- 
and postsynapse’, but our understanding of the essential organiza- 
tional molecules involved in this process remains incomplete’. 
Teneurin proteins are conserved, epidermal growth factor (EGF)- 
repeat-containing transmembrane proteins with large extracellular 
domains’. Here we show that two Drosophila Teneurins, Ten-m and 
Ten-a, are required for neuromuscular synapse organization and 
target selection. Ten-a is presynaptic whereas Ten-m is mostly post- 
synaptic; neuronal Ten-a and muscle Ten-m form a complex in vivo. 
Pre- or postsynaptic Teneurin perturbations cause severe synapse 
loss and impair many facets of organization trans-synaptically and 
cell autonomously. These include defects in active zone apposition, 
release sites, membrane and vesicle organization, and synaptic trans- 
mission. Moreover, the presynaptic microtubule and postsynaptic 
spectrin cytoskeletons are severely disrupted, suggesting a mech- 
anism whereby Teneurins organize the cytoskeleton, which in turn 
affects other aspects of synapse development. Supporting this, 
Ten-m physically interacts with a-Spectrin. Genetic analyses of 
teneurin and neuroligin reveal that they have differential roles that 
synergize to promote synapse assembly. Finally, at elevated 
endogenous levels, Ten-m regulates target selection between specific 
motor neurons and muscles. Our study identifies the Teneurins as a 
key bi-directional trans-synaptic signal involved in general synapse 
organization, and demonstrates that proteins such as these can also 
regulate target selection. 

Vertebrate teneurins are enriched in the developing brain*”, localize 
to synapses in culture’, and pattern visual connections’. Both 
Drosophila Teneurins, Ten-m and Ten-a, function in olfactory synaptic 
partner matching® and were further identified in neuromuscular junc- 
tion (NMJ) defect screens””°®, with Ten-m also affecting motor axon 
guidance’. We examine their roles and the underlying mechanisms 
involved in synapse development. 

Both Ten-m and Ten-a were enriched at the larval NMJ (Fig. 1a and 
Supplementary Fig. 1a). Ten-a was detected at neuronal membranes: 
this staining was undetectable beyond background in ten-a null 
mutants (Supplementary Fig. 1b) and barely detectable after neuronal 
ten-a RNA interference (RNAi; Supplementary Fig. 1c), indicating that 
Ten-a is predominantly presynaptic. Partial co-localization was 
observed between Ten-a and the periactive zone marker Fasciclin 2 
(ref. 12) as well as the active zone marker Bruchpilot’’ (Fig. 1b, c), 
suggesting that Ten-a is localized to the junction between the periactive 
zone and the active zone. Ten-m appeared strongly postsynaptic and 
surrounded each bouton (Fig. la and Supplementary Fig. la, d). 
Muscle-specific ten-m RNAi eliminated the postsynaptic staining, 
but uncovered weak presynaptic staining (Supplementary Fig. le) that 
ubiquitous ten-m RNAi eliminated (Supplementary Fig. 1f). Thus, the 
Ten-m signal was specific and, although partly presynaptic, enriched 
postsynaptically. Consistently, muscle Ten-m colocalized extensively 
with Dlg (Fig. 1d) and completely with «-Spectrin (Fig. le) and is thus 
probably coincident with all postsynaptic membranes. 


The localization of Ten-a and Ten-m suggested their trans-synaptic 
interaction. To examine this, we co-expressed Myc-tagged Ten-a in 
nerves using the Q system’* and haemagglutinin (HA)-tagged 
Ten-m in muscles using GAL4. Muscle Ten-m was able to co- 
immunoprecipitate nerve Ten-a from larval synaptosomes (Fig. 1f), 
suggesting that the Teneurins form a heterophilic trans-synaptic 
receptor pair at the NMJ. 

To determine Teneurin function at the NMJ, we examined the ten-a 
null allele and larvae with neuron or muscle RNAi of ten-a and/or ten-m. 
Following such perturbations, bouton number and size were altered: the 
quantity was reduced by 55% (Fig. 2a—c, g and Supplementary Fig. 2) 


*~ |IB: Myc (Ten-a) 


IB: HA (Ten-m) 


a IB: Brp (Input) 


Figure 1 | Teneurins are enriched at and interact across Drosophila 
neuromuscular synapses. a-e, Representative single confocal sections of 
synaptic boutons stained with antibodies against Ten-a (red) or Ten-m (green), 
horseradish peroxidase (HRP) to mark the neuronal membrane (blue), and a 
synaptic marker as indicated. a, Ten-a is associated with presynaptic 
membranes and Ten-m largely with the surrounding postsynapse (a). b, c, Ten- 
a shows limited co-localization with the periactive zone marker Fasciclin 2 
(b), and Bruchpilot (Brp), an active zone marker (c). d, e, Ten-m co-localizes 
with, and extends beyond, Dlg (d) and completely co-localizes with muscle 
a-Spectrin (e). f, immunoblots (IB) of larval synaptosomes expressing neuronal 
Flag-Myc-tagged Ten-a (N) and muscle Flag~HA-tagged Ten-m (M) and 
immunoprecipitated (IP) using antibodies to HA. Ten-a is detected in the pull- 
down, indicating that nerve Ten-a and muscle Ten-m interact across the NMJ. 
This is not seen in control lanes. Owing to low expression, neither transgene 
product is detectable in input lysates, which are enriched in Brp. 

Scale bar, 5 um. 
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Figure 2 | Teneurins affect the structure and function of the neuromuscular 
synapse. a-f, Representative NMJs stained with antibodies to HRP. Muscle- 
specific ten-m RNAi (M-IR) and lots of ten-a decrease bouton number. 
Neuronal (Ten-a N) but not muscle (Ten-a M) restoration of ten-a expression 
rescues this phenotype. These defects resemble n/g1 mutants and are enhanced 
in ten-a nlg1 double mutants. g, Quantification of bouton number. 

h-j, Representative NMJs stained with antibodies to Brp (green), the glutamate 
receptor subunit DGluRIII (red) and HRP (blue). In control larvae (h), Brp and 
DGluRIII puncta properly appose. teneurin perturbations (i, j) disrupt this 
active zone (yellow arrowhead) and glutamate receptor apposition (white 
arrowhead). k, Quantification of unapposed active zone/glutamate receptor 
pairs. For each quantification, n = 8 larvae, 16 NMJs. l-q, Transmission 
electron microscopy of active zone T-bars (asterisks) in control larvae (I) and 


and the incidence of large boutons markedly increased (Supplemen- 
tary Fig. 2k). Both changes indicate impaired synaptic morphogenesis. 
The reduction in bouton number was probably cumulative through 
development, as it was visible in first instar ten-a mutants and persisted 
(Supplementary Fig. 2k). In the ten-a mutant, bouton morphogenesis 
was rescued by restoring Ten-a expression in neurons, but not muscles 
(Fig. 2d, g and Supplementary Fig. 2). Neuronal Ten-m overexpres- 
sion could not substitute for the lack of Ten-a, revealing their non- 
equivalence (Supplementary Fig. 2e, 1). Neuronal knockdown of Ten-a 
or Ten-m resulted in fewer synaptic boutons (Supplementary Fig. 2f- 
h, 1), indicating that both have a presynaptic function, although pre- 
synaptic Ten-a has a more predominant role (Supplementary Fig. 21). 
Moreover, knocking down postsynaptic Ten-m in the ten-a mutant 
did not enhance the phenotype (Fig. 2g). Thus, presynaptic Ten-a 
(and, to a lesser extent, Ten-m) and postsynaptic Ten-m are required 
for synapse development. 

teneurin perturbation also caused defects in the apposition between 
presynaptic active zones (release sites) and postsynaptic glutamate 
receptor clusters'> (Fig. 2h and Supplementary Fig. 3): up to 15% of 
the active zones/receptor clusters lacked their partner compared to 
1.8% in controls (Fig. 2h-k). Under electron microscopy, active zones 
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ten-a mutants (m-q) showing double T-bars (m), detached T-bars 

(n), misshapen T-bars (0), membrane ruffling (p, waved arrows) and T-bars 
facing contractile tissue (q). Some images show multiple defects. r, Distribution 
of T-bar defects as a percentage of the total T-bars. N-IR, neuronal RNAi. For 
each genotype, n = 3 larvae, 40 boutons. s, t, Representative evoked EPSP 

(s) and mEPSP (t) traces from control and ten-a mutant genotypes. 

u, Quantification of mean EPSP amplitude (black), mEPSP amplitude (red), 
mEPSP frequency (blue) and quantal content (QC, white) expressed as a 
percentage of the control average. For all genotypes, n = 7 larvae, 8 muscles. 
Error bars represent s.e.m. Scale bars, 10 [1m (a-f), 5 um (h-j), 100 nm (I- 

q). ***P < 0.001, **P < 0.01, *P <0.05, NS, not significant. Statistical 
comparisons are with control unless noted. 


are marked by electron-dense membranes and single presynaptic 
specializations called T-bars (Fig. 21), which enable synapse assembly, 
vesicle release and Ca”’-channel clustering’. Teneurin disruption 
caused defects (Fig. 2m-r and Supplementary Fig. 3) in T-bar ultrastruc- 
ture (Fig. 2m-o), membrane organization, and apposition to contractile 
tissue (Fig. 2p, q). Teneurin perturbation also impaired postsynaptic 
densities while increasing membrane ruffling (Supplementary Table 1), 
further indicating organizational deficiency. These phenotypes 
resemble mutants with adhesion and T-bar biogenesis defects’””*, 
suggesting a role for Teneurins in synaptic adhesion and stability. 
Synaptic vesicle populations similarly required Teneurins for cluster- 
ing at the bouton perimeter and proper density (Supplementary Fig. 4). 
As these effects are not synonymous with active zone disruption’, 
Teneurins are also required for synaptic vesicle organization. 
Synapses lacking teneurin were also functionally impaired. The mean 
amplitude of evoked excitatory postsynaptic potentials (EPSPs) in larvae 
was decreased by 28% in the ten-a mutant (Fig. 2s, u). Spontaneous 
miniature EPSPs showed a 20% decrease in amplitude, a 46% decrease in 
frequency (Fig. 2t, u), and an altered amplitude distribution compared 
with control (Supplementary Fig. 5a). These defects resulted in a 20% 
reduction in quantal content (Fig. 2u), which could be partly due to 
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fewer boutons and release sites. However, release probability may also be 
reduced, as suggested by an increased paired pulse ratio in ten-a mutants 
(Supplementary Fig. 5d, e). The decay kinetics of responses were faster 
in ten-a mutants, suggesting additional postsynaptic effects on glutam- 
ate receptors and/or intrinsic membrane properties (Supplementary 
Fig. 5b, c). Further, FM1-43 dye loading revealed markedly defective 
vesicle cycling in ten-a mutants (Supplementary Fig. 5f, h). Consistent 
with physiological impairment, teneurin-perturbed larvae exhibited 
profound locomotor defects (Supplementary Fig. 5i). In summary, 
Teneurins are required for multiple aspects of NMJ organization and 
function. 

As a potential mechanism for synaptic disorganization following 
teneurin perturbation, we examined the pre- and postsynaptic 
cytoskeletons. In the presynaptic terminal, organized microtubules 
contain Futsch (a microtubule-binding protein)-positive ‘loops’, 
whereas disorganized microtubules possess punctate, ‘unbundled’ 
Futsch”’. Each classification normally represented ~ 10% (often distal) 
of boutons (Fig. 3a, d and Supplementary Fig. 6). Upon teneurin per- 
turbation, many more boutons had unbundled Futsch (Fig. 3b, c and 
Supplementary Fig. 6) whereas those with looped microtubules were 
decreased by 62-95% (Fig. 3d). Therefore, proper microtubule organ- 
ization requires pre- and postsynaptic Teneurins. In contrast to mild 
active zone/glutamate receptor apposition defects, most boutons dis- 
played microtubule organizational defects. 

teneurin perturbation also severely disrupted the postsynaptic spectrin 
cytoskeleton, with which Ten-m colocalized (Fig. le). Postsynaptic 
a-Spectrin normally surrounds the bouton (Fig. 3e). Perturbing neuronal 
or muscle Teneurins markedly reduced postsynaptic o-Spectrin without 
affecting Dlg (Fig. 3f-h and Supplementary Fig. 7). Postsynaptic 
B-Spectrin”’, Adducin” and Wsp were similarly affected (Supplemen- 
tary Fig. 8). In muscle, «-Spectrin is coincident with and essential for 
the integrity of the membranous subsynaptic reticulum (SSR)**”. 
Consistent with this, teneurin disruption reduced SSR width up to 
70% (Supplementary Fig. 9d—g) and increased the frequency of ‘ghost’ 
boutons, which are failures of postsynaptic membrane organization” 
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Figure 3 | Teneurin perturbation results in marked cytoskeletal 
disorganization. a—c, Representative NMJs stained with antibodies to Futsch 
(green) and HRP (magenta). Arrowheads indicate looped organization. Arrows 
indicate unbundled Futsch. d, Quantification of the percentage of total boutons 
with looped or unbundled microtubules. e-g, Representative NMJs stained 
with antibodies to «-Spectrin (green), Dlg (red) and HRP (blue). Following 
teneurin perturbation, o-Spectrin staining is largely lost. Axonal o-Spectrin is 
unaffected by muscle teneurin RNAi (f). h, Quantification of «-Spectrin (green) 
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(Supplementary Fig. 9a—d). Thus, Teneurins are involved in the organ- 
ization of the pre- and postsynaptic cytoskeletons and postsynaptic 
membranes. Further, endogenous «-Spectrin co-immunoprecipitated 
with muscle-expressed, Flag-tagged Ten-m (Fig. 3i), suggesting that 
Ten-m physically links the synaptic membrane to the cytoskeleton. 

Because the most severe defects following teneurin perturbation 
were cytoskeletal, we propose that Teneurins primarily organize the 
presynaptic microtubule and postsynaptic spectrin-based cytoskeletons 
(Fig. 3j), which then organize additional synaptic aspects”’*'. However, 
such a solitary role cannot fully explain the observed phenotypes. The 
reduction in bouton number associated with cytoskeletal disruption is 
milder than that following teneurin disruption”*'**. Also, although 
active zone dynamics are affected by cytoskeletal perturbation”', defects 
in apposition are not*’”*. Moreover, the T-bar structural defects more 
closely resemble synapse adhesion and active zone formation 
defects'”"*. Thus, Teneurins may regulate release site organization 
and synaptic adhesion independent of the cytoskeleton (Fig. 3)). 

Our data also indicate that Teneurins act bi-directionally across the 
synaptic cleft. Ten-a acts predominantly in neurons, as evidenced by 
localization, phenotypes caused by neuronal (but not muscle) knock- 
down, and mutant rescue by neuronal (but not muscle) expression 
(Figs 2 and 3 and Supplementary Figs 2-4, 6, 7 and 9). Yet, in addition 
to the presynaptic phenotypes, many others were postsynaptic, includ- 
ing reduced muscle spectrin, SSR, and membrane apposition (Fig. 3 
and Supplementary Figs 7-9). Similarly, although Ten-m is present 
both pre- and postsynaptically, muscle knockdown resulted in pre- 
synaptic defects, including microtubule and vesicle disorganization, 
reduced active zone apposition, and T-bar defects (Figs 2 and 3 and 
Supplementary Figs 3, 4, 6 and 7). Thus, Teneurins function in 
bi-directional trans-synaptic signalling to organize neuromuscular 
synapses. This may involve downstream pathways or simply establish 
an organizational framework by the receptors themselves. Moreover, 
as the results of single disruptions of neuronal ten-a or muscle ten-m 
were similarly severe and not enhanced by combination (Figs 2g and 
3d, h and Supplementary Fig. 2k), both Ten-a and Ten-m probably 
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and Dlg (red) fluorescence. A.U., arbitrary units. For all genotypes, n = 6 larvae, 
12 NMJs. i, Immunoblots (IB) showing that «-Spectrin is detected in the Flag 
immunoprecipitates (IP) of larvae expressing muscle Flag-HA-tagged Ten-m 
but not in control larvae. Owing to low expression, Flag~HA-Ten-m is only 
detectable after enrichment by immunoprecipitation. j, Model showing the 
roles of Teneurins, Neurexin and Neuroligin at the NMJ. Arrow size represents 
the relative contribution of each pathway to the cellular process as inferred from 
mutant phenotypic severity. Scale bars, 5 jim. ***P < 0.001, NS, not significant. 
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function in the same pathway. Our finding that Ten-a and Ten-m 
co-immunoprecipitate from different cells in vitro’ and across the 
NMJ in vivo (Fig. 1f) further suggests a signal via a trans-synaptic 
complex. Teneurin function, however, may not be solely trans- 
synaptic. In some cases (vesicle density, SSR width), cell-autonomous 
knockdown resulted in stronger phenotypes than knocking down in 
synaptic partners (Supplementary Figs 3, 4, 9 and Supplementary 
Table 1). This suggests additional cell-autonomous roles in addition 
to trans-synaptic Teneurin signalling. 

Signalling involving the transmembrane proteins Neurexin and 
Neuroligin also mediates synapse development”. In Drosophila, Neurexin 
(nrx) and Neuroligin 1 (nlg1) mutations cause phenotypes similar to 
teneurin perturbation: reductions in bouton number, active zone organ- 
ization, transmission, and SSR width”””. nlg1 and nrx mutations do not 
enhance each other, suggesting that they function in the same path- 
way”*. Consistently’”*, we found that nrx and nlg] mutants exhibited 
largely similar phenotypes (data not shown). To investigate the rela- 
tionship between the teneurin genes and nrx and nlg1, we focused on 
the nig1 null mutant. Both Nlg1 tagged with enhanced green fluor- 
escent protein (Nlgl-eGFP) and endogenous Ten-m occupied a 
similar postsynaptic space (Supplementary Fig. 10a). teneurin and 
nigl loss-of-function mutations also displayed similar bouton number 
reductions (Fig. 2e, g), vesicle disorganization (Supplementary Fig. 4), 
and ghost bouton frequencies (Supplementary Fig. 9). Other pheno- 
types showed notable differences in severity. In nig] mutants, there was 
a 29% failure of active zone/glutamate receptor apposition (Fig. 2k and 
Supplementary Fig. 10d), compared to 15% for the strongest teneurin 
perturbation. The cytoskeleton of nlg1 mutants, however, was only 
mildly impaired compared to that seen with teneurin perturbations 
(Fig. 3d, h and Supplementary Figs 6 and 7). 

To examine further the interplay of feneurin and nlgl, we analysed 
ten-a nlg1 double mutants. Both single mutants were viable, despite 
their synaptic defects. Double mutants, however, were larval lethal. We 
obtained rare escapers, which showed a 72% reduction in boutons, 
compared to a 50-55% decrease in single mutants (Fig. 2e). Active 
zone apposition in double mutants was enhanced synergistically over 
either single mutant (Fig. 2k and Supplementary Fig. 10e). Cytoskeletal 
defects in the double mutant resembled the ten-a mutant (Fig. 3 and 
Supplementary Figs 6 and 7). These data suggest that teneurin genes 
and nrx and nig] act in partially overlapping pathways, cooperating to 
organize synapses properly, with Teneurins contributing more to 
cytoskeletal organization and Neurexin and Neuroligin to active zone 
apposition (Fig. 3)). 

In the accompanying manuscript®, we showed that although the basal 
Teneurins are broadly expressed in the Drosophila antennal lobe, ele- 
vated expression in select glomeruli mediates olfactory neuron partner 
matching. At the NMJ, this basal level mediates synapse organization. 
Analogous to the antennal lobe, we found elevated ten-m expression at 
muscles 3 and 8 using the ten-m-GAL4 enhancer trap (Fig. 4a). We 
confirmed this for endogenous ten-m, and determined that it was con- 
tributed by elevated Ten-m expression in both nerves and muscles 
(Fig. 4b-g). Indeed, ten-m-GAL4 was highly expressed in select motor 
neurons, including MN3-Ib, which innervates muscle 3 (ref. 29; 
Supplementary Fig. 11c). This elevated larval expression also varied 
along the anterior—posterior axis (Supplementary Fig. 12), and was 
specific for Ten-m, as Ten-a expression did not differ within or between 
segments (data not shown). 

To test whether elevated Ten-m expression in muscle 3 and MN3-Ib 
affects neuromuscular connectivity, we expressed ten-m RNAi using 
ten-m-GAL4, Wild-type muscle 3 was almost always innervated 
(Fig. 4h). However, after ten-m knockdown, muscle 3 innervation 
failed in 11% of hemisegments (Fig. 4i, j). This required Ten-m on 
both sides of the synapse, as the targeting phenotype persisted follow- 
ing neuronal or muscle RNAi suppression using tissue-specific GAL80 
transgenes (Fig. 4j). ten-a RNAi did not show this phenotype (Fig. 4j), 
consistent with homophilic target selection via Ten-m. The phenotype 
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Figure 4 | High-level Ten-m expression regulates muscle target selection. 
a, Representative images of hemisegment A3 stained with antibodies to Dlg 
(blue), phalloidin (red), and expressing GFP via ten-m-GAL4 (green). High-level 
expression is observed in muscles 3 and 8 and basally in all muscles. b, c, Muscle 3 
(b) and 4 (c) NMJs show differential Ten-m (red) but similar Synaptotagmin 1 
(Syt1; green) expression (from a ten-m muscle knockdown animal). 

d, Quantification of presynaptic Ten-m (red) and Syt1 (green) fluorescence at 
muscle 3 and 4 NMJs. MN, motor neuron. e, f, NMJs at muscles 3 (e) and 4 
(f) show differential Ten-m (red) but similar Syt1 (green) expression in muscles 
(from a ten-m nerve knockdown). g, Quantification of postsynaptic Ten-m (red) 
and Syt1 (green) fluorescence at muscle 3 and 4 NMJs. h, i, Representative 
images stained with phalloidin (blue) and antibodies to HRP (green) and Dlg 
(red) to visualize motor neurons and muscles in control (h) or ten-m- 
GAL4>ten-m RNAi larvae (i). m, muscle. j, Quantification of the hemisegment 
percentage with failed muscle 3 (red), 4 (black) or 2 (blue) innervation. IR, 
interfering RNA. k, 1, Representative images of the muscle 6/7 NMJ labelled with 
antibodies to Dg (green) and HRP (magenta). The characteristic wild-type 
arrangement of boutons (k) is shifted towards muscle 6 when Ten-m is 
overexpressed in that muscle and the innervating motor neurons 

(1). m, Quantification of the total bouton percentage on muscles 6 (blue) and 7 
(red). All genotypes contain H94-GAL4, additional transgenes are indicated (for 
details, see Methods). The Ten-m-mediated shift is abolished by neuronal or 
muscle GAL80 transgenes. Scale bars, 100 1m (a), 5 um (b-i), 10 pm (k, 1). In all 
cases, n= 12 larvae. ***P < 0.001, **P < 0.01, NS, not significant. 


was specific to muscle 3, as innervation onto the immediately proximal 
or distal muscle was unchanged (Fig. 4j). The low penetrance is probably 
due to redundant target selection mechanisms”. Where innervation 
did occur, the terminal displayed similarly severe phenotypes to other 
NMJs (not shown). Thus, in addition to generally mediating synaptic 
organization, Ten-m also contributes to correct target selection at a 
specific NMJ. 

To determine whether Ten-m overexpression could alter connec- 
tivity, we expressed Ten-m in muscle 6 (but not the adjacent muscle 7), 
and the motor neurons innervating both muscles using H94-GAL4. 
Normally, 60% of the boutons at muscles 6/7 are present on muscle 6 
with 40% on muscle 7 (Fig. 4k, m). Ten-m overexpression caused a 
shift whereby 81% of boutons synapsed onto muscle 6 and only 19% 
onto muscle 7 (Fig. 41, m). This shift also required both neuronal and 
muscle Ten-m, as neuronal or muscle GAL80 abrogated it (Fig. 4m). 
The effect was specific because Ten-a overexpression did not alter this 
synaptic balance (Fig. 4m), nor was it secondary to altered bouton 
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number, which was unchanged (data not shown). Therefore, elevated 
Ten-m on both sides of the NMJ can bias target choice. This, combined 
with evidence that Ten-m can mediate homophilic interaction in vitro’, 
supports a trans-synaptic homophilic attraction model at the NMJ as 
in the olfactory system. 

We identified a two-tier mechanism for Teneurin function in synapse 
development at the Drosophila NMJ. At the basal level, Teneurins are 
expressed at all synapses and engage in hetero- and homophilic 
bi-directional trans-synaptic signalling to organize synapses properly 
(Fig. 3j). Supporting this, Teneurins can mediate homo- and hetero- 
philic interactions in vitro’ and heterophilic interactions in vivo 
(Fig. 1f). At the synapse, Teneurins organize the cytoskeleton, interact 
with «-Spectrin, and enable proper adhesion and release site forma- 
tion. Further, elevated Ten-m expression regulates target selection in 
specific motor neurons and muscles via homophilic matching and 
functions with additional molecules” to mediate precise neuromuscular 
connectivity. Teneurin-mediated target selection at the NMJ is ana- 
logous to its role in olfactory synaptic partner matching®. As Teneurins 
are expressed broadly throughout the antennal lobe, it remains an 
attractive possibility that they also regulate synapse organization in 
the central nervous system. 


METHODS SUMMARY 


Details of Drosophila stocks, immunostaining, electron microscopy, functional 
assays, construction of epitope-tagged Teneurin constructs, immunoprecipitation, 
imaging and statistical analysis can be found in Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Drosophila stocks. All Drosophila strains and controls were raised at 29 °C to 
maximize GAL4 expression. All mutants and transgenes were maintained over 
GFP balancer chromosomes to enable larval selection. Mhc-GAL4 or Mef2- 
GALA4 (ref. 31) was used to drive expression in all somatic muscles. Nrv2-GAL4 
(ref. 32) and elav-GAL4 (ref. 33) were used to drive expression in all neurons. H94- 
GAL4 was used to drive expression in muscles 6, 13 and 4 and their corresponding 
motor neurons™. daughterless-GAL4 was used to drive expression ubiquitously”. 
Synj-QF*® was used to drive expression in all nerves. NP6658-GAL4 (ten-m-GAL4) 
was used to drive expression in the pattern of endogenous Ten-m expression®. The 
Df(X)ten-a deletion was used as a ten-a null mutant. For n/g1 mutants, the 1960 
and ex2.3 alleles were used in trans** and double mutant larvae with ten-a and nlg1 
mutations were obtained using optimized rearing conditions’. Because of the 
early lethality of the ten-m mutant'’, and to assess independently ten-a, tissue- 
specific RNAi was used to examine teneurin perturbation using the following 
RNAi transgenic strains: for ten-m, UAS-ten-m®N4-Y51173 and for ten-a (ref. 8), 
UAS-ten-a®“* V4? The following transgenic strains were used: UAS-Dcr2 
(ref. 38), UAS-Fas2 (ref. 34), UAS-mCD8GFP (ref. 39), UAS-Nigl-eGEP 
(ref. 28), UAS-Ten-a (ref. 8), P{GS}9267 for ten-m overexpression*. In all cases, 
the efficacy of RNAi transgenes, overexpression transgenes and the ten-a deletion 
mutant were assessed and verified by the alteration of antibody staining (loss, 
reduction or increase) using tissue-specific GAL4 drivers. For all cases, N-IR 
indicates neuronal RNAi, M-IR indicates muscle RNAi, U-IR indicates ubiquitous 
RNAi. For rescuing ten-a mutants, Ten-a N indicates neuronal overexpression 
with elav-GAL4 of UAS-ten-a and Ten-a M indicates muscle overexpression of 
UAS-ten-a with Mef2-GAL4. 

Immunostaining. Wandering third instar larvae were processed as previously 
described”. The following primary antibodies were used: mouse antibody to 
Ten-m (mAb20, 1:500)*°, guinea pig antibody to Ten-a (1:100)*’, mouse antibody 
to Brp (mAbnc82, 1:250)”, rabbit antibody to Synaptotagmin 1 (1:4,000)*, mouse 
antibody to Cysteine String Protein (mAb6D6, 1:100)**, mouse antibody to Dlg 
(mAb4F3, 1:500)*, rabbit antibody to Dlg (1:40,000)**, mouse antibody to 
a-Spectrin (mAb3A9, 1:50)"”, mouse antibody to Fasciclin 2 (mAb1D4, 1:20)”, 
rabbit antibody to Fasciclin 2 (1:5,000)*°, mouse antibody to Futsch (mAb22C10, 
1:50), rabbit antibody to DGluRIII (1:2,500)'*, rat antibody to Elav 
(mAb7E8A10, 1:25), mouse antibody to Even-skipped (mAb3C10, 1:100)°°, 
rabbit antibody to B-Spectrin (1:1,000)*', mouse antibody to Hts (1:500)”, guinea 
pig antibody to Wsp (1:1,000)**. Alexa488-, Alexa546- or Alexa647-conjugated 
secondary antibodies were used at 1:250 (Invitrogen). Texas-Red-conjugated 
Phalloidin was used at 1:300. FITC-, Cy3- or Cy5-conjugated antibodies to HRP 
were used at 1:100 (Jackson ImmunoResearch). 

Electron microscopy. Wandering third instar larvae were processed and sec- 
tioned as described”*. Sections were imaged on a JEM-1400 (JEOL) transmission 
electron microscope at 3,000 to *20,000 magnification. 

Electrophysiology. Larvae were dissected in HL3 saline® containing 0 mM Ca** 
and 4mM Mg”". They were then transferred to saline containing 0.6 mM Ca** 
and recordings conducted by impaling larval muscle 6 in body wall segments A3 
and A4 using sharp intracellular electrodes (10-20MQ), fabricated from 
borosilicate glass capillaries and filled with 3 M KCl solution. For evoked EPSPs, 
severed nerve bundles were stimulated using a suction electrode connected to a 
linear stimulus isolated (A395, World Precision Instruments). Data, acquired 
using Multiclamp 700B amplifiers (Molecular Devices), were low-pass filtered at 
3 kHz and digitized at 10 kHz. Recordings were acquired and analysed using Igor 
Pro software (Wavemetrics) and custom-written programs. All recordings in 
which the resting membrane potential was higher than —60 mV and/or whose 
resting potential, input resistance or access resistance changed by more than 20% 
during the duration of data acquisition were excluded from analysis. All recordings 
and data analyses were performed blind to the genotype. Quantal content was 
corrected for nonlinear summation”. 

Larval locomotion. Crawling assays were conducted as described”. 

FM1-43 dye loading. FM1-43 (Invitrogen) dye loading was conducted as 
described*® with the following modifications: loading was conducted in 1.5mM 
Ca?*, 90 mM K% saline for 1 min followed by six 2-min washes in 0 mM Ca?* 
saline. Imaging was conducted on a Zeiss LSM 510 Meta Confocal (Carl Zeiss) 
with a X40, PlanApo NA 1.0 water immersion lens (Carl Zeiss). 

Construction of epitope-tagged teneurin transgenes. The ten-m and ten-a 
coding sequences lacking the stop codon were cloned into pENTR-D/TOPO 
(Invitrogen) from pENTR-ten-m and pENTR-ten-a*. pENTR-ten-m(-stop) and 
pENTR-ten-a(-stop) were recombined into the destination vectors pUASTattB- 
gtw-tFHAH and pQUASTattB-gtw-tFMH, respectively, using LR Clonase II 
(Invitrogen). pUASTattB-gtw-tFHAH is a pUASt-Gateway-attB based vector with 
a C-terminal TEV recognition site and 3XFlag, 3XHA and 10XHis tags. 
PQUASTattB-gtw-tFMH is a pQUASt-Gateway-attB’ based vector with a 


C-terminal TEV recognition site and 3xFlag, 6<Myc and 10XHis tags. The 
resulting constructs were verified by restriction digest and sequencing and inte- 
grated into the attP24 or 86Fb landing sites on the second and third chromo- 
somes”. Transgenic flies were verified by immunoprecipitation on western blot 
and overexpression experiments. 

Immunoprecipitation, western blots and SDS-PAGE analysis. For Ten-m and 
Ten-a, QUAS-Ten-a-Flag-Myc was expressed in nerves using Synj-QF and UAS- 
Ten-m-Flag-HA in muscles using mhc-GAL4. Larval synaptosomes were prepared 
from larval body wall fillets as described**. For Ten-m and «-Spectrin, control 
larvae consisted of Mef2-GAL4 without UAS-Ten-m-Flag-HA whereas experi- 
mental flies combined the two. Immunoprecipitation was conducted as described 
using M2-anti-Flag-conjugated agarose (Sigma) or Affi-Prep Protein A beads 
(Bio-Rad) and rat antibodies to HA (Roche)”. Proteins were separated on 
NuPAGE 3-8% Tris-Acetate Gels (Invitrogen) and transferred to nitrocellulose. 
Primary antibodies were applied overnight at 4°C and secondary antibodies at 
21°C for 1h. The following primary antibodies were used: mouse antibody to 
ot-Spectrin (mAb3A9, 1:2,000), mouse antibody to Brp (mAbnc82, 1:100), mouse 
antibody to Flag (M2, 1:5,000, Sigma-Aldrich), mouse antibody to Myc (3E10, 
1:1,500, Santa Cruz Biotechnology), rat antibody to HA (3F10, 1:1,500, Roche). 
HRP-conjugated secondary antibodies (Jackson ImmunoResearch) were used at 
1:10,000. Blots were developed using the SuperSignal West Femto Maximum 
Sensitivity Substrate (ThermoScientific). 

Imaging analysis. Larvae were imaged with a Zeiss LSM 510 Meta laser-scanning 
confocal microscope (Carl Zeiss) using either a X63 1.4 NA or a X40 1.0 NA 
objective. NMJ images were taken as confocal z-stacks with the upper and lower 
bounds defined by HRP staining unless otherwise noted. For all metrics, boutons 
were assessed in segment A3 at muscle 6/7 and muscle 4 on both the left and right 
sides. Fluorescence intensity measurements were taken from terminals on muscle 
4. All phenotypes, however, were observed at all synapses regardless of muscle 
fibre or segment. For membrane organization, vesicle distribution and Teneurin 
colocalization, NMJ images were taken as single optical sections at the precise 
centre of the bouton as determined by HRP staining. Images were processed with 
the LSM software and Adobe Photoshop CS4. Bouton number, active zone/ 
glutamate receptor apposition, fluorescent intensity and microtubule organization 
were quantified as previously described”’. Targeting errors for each larva were quan- 
tified as the percentage of hemisegments from Al to A7 in a single animal with a 
failure of target innervation. There was no difference in targeting errors based on 
body wall segment. Experiments using H94-GAL4 were conducted as described™, 
and their effects confirmed using Fasciclin 2 overexpression (control = 58.1% of 
boutons on muscle 6, 41.9% on muscle 7; Fas 2 overexpression = 73.0% on muscle 
6, 27.0% on muscle 7; n = 8 animals for each, P< 0.0001)**. 

In electron micrographs, parameters were quantified as previously described 
using ImageJ (NIH)**. T-bar defects were classified into one of five categories: 
normal (no discernible defect), double (two T-bars were observed in the same, 
continuous active zone), detached (where the T-bar was clearly visible but was not 
explicitly connected to the membrane associated with the nearest PSD), apposite 
contractile tissue (where the T-bar was not apposed to the SSR, but rather, the 
contractile tissue of the muscle), misshapen (where an electron-dense T-bar was 
visible but did not conform to the ‘T’ shape. Often, the T-bars were ‘X’ shaped). For 
Fig. 2r, each defect is expressed as a percentage of the total number of T-bars 
observed in a particular genotype. 

For Ten-m gradient calculation, single optical sections were taken through the 
centre of the NMJ on muscle 3 or muscle 4, as determined by HRP immunoreacti- 
vity. The GFP signal (ten-m-GAL4) or antibody signal was then measured using 
Image] (NIH). For each larva, measurements were taken on the right and left sides 
of each indicated segment. The fluorescence for each segment was expressed as a 
percentage of the fluorescence from segment A1 in the same animal, on the same 
side of the larvae. For all larvae, segment A1 represented the maximal fluorescence. 
Statistical analysis. Statistical analysis used GraphPad Prism 5 (Graphpad 
Software). In all cases involving more than two samples, significance was calcu- 
lated using ANOVA followed by a Dunnett post-hoc test to the control sample and 
a Bonferroni post-hoc test among all samples. For two-sample cases, an unpaired 
Student’s t-test was used to assess significance, unless otherwise indicated. In all 
cases, both methods provided similar significance measurements. In all figures, 
significance is with respect to control genotypes unless otherwise noted. 
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Teneurins instruct synaptic partner 
matching in an olfactory map 


Weizhe Hong", Timothy J. Mosca! & Liqun Luo? 


Neurons are interconnected with extraordinary precision to assemble a functional nervous system. Compared to axon 
guidance, far less is understood about how individual pre- and postsynaptic partners are matched. To ensure the proper 
relay of olfactory information in the fruitfly Drosophila, axons of ~50 classes of olfactory receptor neurons (ORNs) form 
one-to-one connections with dendrites of ~50 classes of projection neurons (PNs). Here, using genetic screens, we 
identified two evolutionarily conserved, epidermal growth factor (EGF)-repeat containing transmembrane Teneurin 
proteins, Ten-m and Ten-a, as synaptic-partner-matching molecules between PN dendrites and ORN axons. Ten-m and 
Ten-a are highly expressed in select PN-ORN matching pairs. Teneurin loss- and gain-of-function cause specific 
mismatching of select ORNs and PNs. Finally, Teneurins promote homophilic interactions in vitro, and Ten-m 
co-expression in non-partner PNs and ORNs promotes their ectopic connections in vivo. We propose that Teneurins 
instruct matching specificity between synaptic partners through homophilic attraction. 


The chemoaffinity hypothesis was proposed nearly 50 years ago to 
explain the target specificity of regenerating optic nerves: developing 
neurons “must carry individual identification tags, presumably cyto- 
chemical in nature, by which they are distinguished one from another 
almost, in many regions, to the level of the single neuron”’. Many 
molecules are now known that guide axons to their target areas’, 
but few may mediate mutual selection and direct matching between 
individual pre- and postsynaptic partners. Here we show that the 
transmembrane Teneurin proteins instruct the selection of specific 
synaptic partners in the Drosophila olfactory circuit (Supplementary 
Fig. 1). 

In Drosophila, individual classes of ORN axons make one-to-one 
connections with individual classes of second-order PN dendrites 
within one of ~50 discrete glomeruli in the antennal lobe. We refer 
to this specific one-to-one connection as PN-ORN synaptic partner 
matching. Olfactory circuit assembly takes place in sequential steps 
before sensory activity begins*®. PN dendrites first elaborate within 
and pattern the developing antennal lobe’, which is followed by 
invasion of ORN axons’* “. Importantly, re-positioning PN dendrites 
redirects their partner ORN axons without disrupting the connec- 
tions'’, suggesting that proper PN-ORN connections probably 
involve direct recognition and matching between partners. 


Matching screens identified Ten-m and Ten-a 

To identify potential PN-ORN matching molecules, we simultaneously 
labelled select PN dendrites and ORN axons in two colours and per- 
formed two complementary genetic screens (Fig. 1a, d). We overex- 
pressed 410 candidate cell-surface molecules, comprising ~40% of the 
potential cell-recognition molecules in Drosophila’’. In the first screen, 
we used Mz19-GAL4 to label DA1, VAld and DC3 PNs (hereafter 
Mz19 PNs), and Or47b-rCD2 to label Or47b ORNs (Fig. 1a, b). 
Or47b ORN axons normally project to the VAllm glomerulus and 
are adjacent to Mz19 PN dendrites without overlap. We overexpressed 
candidate cell-surface molecules only in Mz19 PNs to identify those that 
promoted ectopic connections between Or47b axons and Mz19 den- 
drites (Fig. la). We found that overexpression of ten-m (P{GS}9267; 
Supplementary Fig. 2b) produced ectopic connections (Fig. 1c). 


In the second screen, we labelled Mz19 PNs as above and Or88a 
ORNs using Or88a-rCD2 (Fig. 1d, e). Or88a ORN axons normally 
project to the VAld glomerulus, intermingling extensively with 
VA1d PN dendrites (Fig. le). We overexpressed candidate cell-surface 
molecules in Mz19 PNs (Fig. 1d) as above and found that overexpres- 
sion of ten-a (P{GE}1914, Supplementary Fig. 2a) partially disrupted 
the intermingling of Or88a axons and Mz19 dendrites (Fig. 1f). 

In addition to impairing PN-ORN matching, ten-m and ten-a over- 
expression shifted Mz19 PN dendrite position (Fig. 1c, f). However, 
mismatching was not a secondary consequence of axon or dendrite 
mispositioning; mispositioning alone, caused by perturbation of 
other genes, does not alter PN-ORN matching”’*”*. Furthermore, 
among 410 candidate molecules, only ten-m and ten-a overexpression 
exhibited mismatching defects, suggesting their specificity in PN- 
ORN matching. 

Both ten-m and ten-a appear to encode type II transmembrane 
proteins’”". They possess highly similar domain compositions and 
amino acid sequences; each contains eight EGF-like and multiple YD 
(tyrosine-aspartate) repeats within its large carboxy-terminal extra- 
cellular domain (Fig. 1g). Ten-m and Ten-a were initially identified 
as tenascin-like molecules”®”!, but vertebrate teneurins were later 
identified as their true homologues based on sequence and domain 
similarity (Fig. 1h). Thus, we refer to Ten-m and Ten-a as Drosophila 
Teneurins. Teneurins are present in nematodes, flies and vertebrates. 
In human, teneurin-1 and teneurin-2 are located in chromosomal 
regions associated with intellectual disability’, and teneurin-4 is 
linked to susceptibility to bipolar disorder”. 

Drosophila ten-m was originally identified as a pair-rule gene 
required for embryonic patterning”’”’, but this function was recently 
shown to be unrelated to ten-m**. Teneurins were implicated in 
synapse development at the neuromuscular junction'®”* (see ref. 26), 
and Ten-m also regulates motor axon guidance™*. Neither the under- 
lying mechanisms nor their potential roles in the central nervous system 
are known. Vertebrate teneurins are widely expressed in the nervous 
system’*’” and interact homophilically in vitro***’, suggesting their 
potential role as homophilic cell adhesion molecules in patterning 
neuronal connectivity. 
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Figure 1 | PN-ORN synaptic matching screens identify two Teneurins. 

a, d, Schematics showing two PN-ORN matching screens. PN dendrites are 
labelled by Mz19-GAL4 driving mCD8GFP and ORN axons by Or47b-rCD2 
(a) or Or88a-rCD2 (d). Each EP line has a transposable element insertion that 
places UAS 5’ to a gene encoding a predicted cell-surface protein, which can be 
overexpressed using Mz19-GAL4. b, c, Or47b axons and Mz19 dendrites do not 
overlap in control (b), but form ectopic connections following Ten-m 
overexpression (c), as seen by axon-dendrite intermingling (arrowhead). 

e, f, Or88a axons and Mz19 dendrites connect at the VA1d glomerulus in 
control (e), but the connection is partially lost following Ten-a overexpression, 
as some of the Or88a axons no longer intermingle with Mz19 dendrites 


Matching expression of Ten-m and Ten-a 


Both Drosophila Teneurin proteins were endogenously expressed in 
the developing antennal lobe (Fig. 2a and Supplementary Fig. 3). At 
48 h after puparium formation (APF), when individual glomeruli just 
become identifiable, elevated Teneurin expression was evident in 
select glomeruli. The subset of glomeruli expressing elevated Ten-m 
was distinct but partially overlapping with that expressing elevated 
Ten-a (Fig. 2a, e). Teneurin proteins were also detected at a low level 
in all glomeruli. Both basal and elevated Teneurin expressions were 
eliminated by pan-neuronal RNA interference (RNAi) targeting the 
corresponding gene (Fig. 2b, c), suggesting that Teneurin proteins are 
produced predominantly by neurons. In a fen-a null mutant we 
generated (Supplementary Fig. 2a), all Ten-a expression was eliminated, 
confirming antibody specificity (Fig. 2d). 

The antennal lobe consists of ORN axons as well as PN and local 
interneuron dendrites. We used intersectional analysis to determine the 
cellular source for elevated Teneurin expression. For ten-m, we screened 
GAL4 enhancer traps near the ten-m gene, and identified NP6658 
(hereafter ten-m-GAL4, Supplementary Fig. 2b) that recapitulated the 
glomerulus-specific Ten-m staining pattern (Supplementary Fig. 4a—c). 
We used a FLPout reporter UAS>stop>mCD8GFP to determine the 
intersection of ten-m-GAL4 and an ORN-specific ey-Flp (Fig. 2f and 
Supplementary Fig. 4d-f) or a PN-specific GH146-Flp (Fig. 2g and 
Supplementary Fig. 4g-i). We found that ten-m-GAL4 was selectively 
expressed in a subset of ORNs and PNs. Owing to reagent availability, 
we focused our analysis on five glomeruli (DA1, VA1d, VA11m, DC3 
and DA3), adjacently located on the lateral and anterior side of the 
antennal lobe. In these five glomeruli, Ten-m expression in PN and 
ORN classes matched: high levels in PNs corresponded to high levels in 
ORNs and vice versa (Fig. 2f, g). 

To determine the cellular origin of elevated Ten-a expression, we 
performed tissue-specific RNAi of endogenous Ten-a, as no GAL4 
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(arrowhead). Target areas of Or47b (b, c) or Or88a (e, f) axons are outlined. 
Mismatching phenotypes are quantified in Supplementary Figs 9k and 10q. The 
first three columns in b, c, e, f show separate channels of the same section; the 
fourth shows higher magnification of the dashed squares (as in Figs 3, 4, 5d-g.). 
Unless indicated, all images in this and subsequent figures are single confocal 
sections and all scale bars are 10 um. g, Domain composition of Drosophila 
Ten-m and Ten-a. aa, amino acids; TM, transmembrane domain. h, Phylogeny 
of the Drosophila Teneurins and related proteins in other species. Branch 
lengths represent units of substitutions per site of the sequence alignment. 
Teneurins are evolutionarily conserved in bilaterians and a unicellular 
choanoflagellate Monosiga brevicollis, but not in cnidarians. 


enhancer trap is available near ten-a. To isolate Ten-a expression in 
ORNs, we drove pan-neuronal ten-a RNAi while specifically suppres- 
sing RNAi in ORNs using tubP>stop>GAL80 and ey-Fip (Fig. 2h). 
To restrict Ten-a expression to central neurons, we expressed ten-a 
RNAi in all ORNs (Fig. 2i). We found that Ten-a was highly expressed 
in a subset of ORNs and central neurons, and also showed a matching 
expression in the five glomeruli we focused on (Fig. 2h, i). The 
glomerular-specific differential Ten-a expression in central neurons 
probably arises mainly from PNs as they target dendrites to specific 
glomeruli, and punctate Ten-a staining was observed in PN cell bodies 
(Supplementary Fig. 5). In summary, Ten-m and Ten-a are each 
highly expressed in a distinct, but partially overlapping, subset of 
matching ORNs and PNs (Fig. 2)). 


Teneurins are required for PN-ORN matching 


To examine whether Teneurins are required for proper PN-ORN 
matching, we performed tissue-specific RNAi (Fig. 3 and Supplemen- 
tary Fig. 2c) in all neurons using C155-GAL4, in PNs using GH146- 
GAL4, or in ORNs using peb-GAL4. To label specific subsets of PN 
dendrites independent of GAL4-UAS, we used the Q binary expres- 
sion system”, and converted Mz19-GAL4 to Mz19-QF by bacterial 
artificial chromosome (BAC) recombineering (Supplementary Fig. 2d). 
We could thus perform GAL4-based RNAi knockdown while labelling 
PN dendrites and ORN axons in two colours independent of GAL4. 
We focused our analysis on Mz19 dendrites and Or47b axons, which 
innervate neighbouring glomeruli but never intermingle in wild type 
(Figs 1b and 3a, b). 

Pan-neuronal RNAi of both teneurin genes shifted Or47b axons to 
a position between two adjacent Mz19 glomeruli, DAl and VAId 
(Fig. 3c). Moreover, Mz19 dendrites and Or47b axons intermingled 
without a clear border (Fig. 3c, d), reflecting a PN-ORN matching 
defect. We confirmed this using independent RNAi lines targeting 
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Figure 2 | Ten-mand Ten-aare differentially expressed in matching PN and 
ORN classes. a, Developing antennal lobes at 48h APF stained by antibodies 
against Ten-m, Ten-a, and a neuropil marker, N-cadherin. Solid lines encircle 
the DA1 glomerulus (Ten-m low, Ten-a high). Dashed lines encircle the VA1d/ 
VAIm glomeruli (Ten-m high, Ten-a low). b, c, Ten-m and Ten-a proteins are 
undetectable following pan-neuronal RNAi of ten-m (b) and ten-a 

(c), respectively. d, A ten-a homozygous mutant eliminated the Ten-a antibody 
staining. e, Summary of elevated Ten-m and Ten-a expression in five select 


different regions of the ten-m and ten-a transcripts (Supplementary 
Fig. 6). Further, knocking down ten-m and ten-a only in PNs or only 
in ORNs also led to Mz19-Or47b intermingling (Fig. 3e and 
Supplementary Fig. 7a, d), indicating that Teneurins are required in 
both PNs and ORNs to ensure proper matching. 
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Figure 3 | Loss of Teneurins causes PN-ORN mismatching. a, Normally, 
Mz19 dendrites (green) innervate glomeruli adjacent to the VA11m glomerulus, 
which is itself innervated by Or47b axons (red). The dashed line encircles 
Or47b axons. DC3 PNs are located posterior to DA1/VA1d PNs and Or47b 
ORNs, and are not visible in these sections. c, Mismatching phenotypes in ten- 
mand ten-a RNAi driven by the pan-neuronal driver C155-GAL4. Dashed lines 
encircle Or47b ORN axons, showing intermingling with Mz19 PN dendrites 
(arrowhead). e, Quantification of Mz19-Or47b mismatching phenotypes. For 
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glomeruli. f, g, Expression of the Flp-out GFP reporter UAS>stop>mCD8GFP 
at the intersection of ten-m-GAL4 with ORN-specific ey-Flp (f) or with PN- 
specific GH146-Fip (g) in adult. h, i, Antibody staining of Ten-a in central 
nervous system (CNS)-neuron-specific RNAi (h) or in ORN-specific RNAi 
(i) at 48h APF. f-i, Right, individual cell-type-specific Teneurin expression 
patterns are schematically summarized. j, Combined expression patterns of 
Teneurin proteins in PNs (left) and ORNs (right). Blue, Ten-m high; orange, 
Ten-a high. Scale bars, 10 um. 


Next, we examined the contribution of each Teneurin by individual 
RNAi knockdown in ORNs. Knocking down ten-m and, to a lesser 
extent, ten-a, caused mild mismatching (Fig. 3e and Supplementary 
Fig. 7). This was greatly enhanced by simultaneous knockdown of 
both ten-m and ten-a (Fig. 3e), probably because Mz19-Or47b 
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all genotypes, n = 15. f, In control, DA1 PNs do not intermingle with Or47b 
ORNs. h, MARCM expression of ten-a RNAi in DA1 PNs causes dendrite 
intermingling with Or47b axons (arrowhead). j, Quantification of mismatching 
phenotypes. For all genotypes, n = 6. Error bars represent s.e.m. ***, P< 0.001 
compared to control. b, d, g, i, Summary showing normal connectivity in 
control (a, f) and mismatching phenotypes following teneurin RNAi 

(c, h). Blue, Ten-m high; orange, Ten-a high. Green outlines, labelled PNs. Red 
outlines, labelled ORNs. Scale bars, 10 Lm. 
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mismatching requires weakening connections with their respective 
endogenous partners (Supplementary Fig. 7g). This synergy implies 
that multiple matching molecules can enhance partner matching 
robustness. 

We also tested the functions of individual Teneurins in PNs. We 
found that the Mz19-Or47b mismatching was caused by PN-specific 
knockdown of ten-a, but not ten-m (Fig. 3e and Supplementary Fig. 7). 
As VA1d/DC3 and DA1 PNs arise from separate neuroblast lineages”’, 
we used mosaic analysis with a repressible cell marker (MARCM) to 
generate neuroblast clones to label and knockdown ten-a in DAI or 
VA1d/DC3 PNs (Fig. 3f-j; see Methods). ten-a knockdown only in 
DAI PNs (normally Ten-a high) caused their dendrites to mismatch 
with Or47b axons (Fig. 3h-j). By contrast, ten-a knockdown in VA1d/ 
DC3 PNs (normally Ten-a low) did not cause mismatching (Fig. 3j 
and Supplementary Fig. 8a, b). Similarly, MARCM loss-of-function of 
ten-a mutant in DA] but not in VA1d/DC3 PNs resulted in mismatch- 
ing with Or47b ORNs (Fig. 3j and Supplementary Fig. 8c, d). Thus, 
removal of ten-a from Ten-a-high DA1 PNs caused their dendrites to 
mismatch with Ten-a-low Or47b ORNs (Fig. 3i). The differential 
requirements of Ten-m and Ten-a in ORNs or PNs in preventing 
Mz19-Or47b mismatching probably reflect differential expression of 
Ten-m and Ten-a in the mismatching partners. 

Our finding that loss of ten-a caused Ten-a-high PNs to mismatch 
with Ten-a-low ORNs (Fig. 3i, j), together with the matching expres- 
sion of Teneurin proteins in PNs and ORNs, raised the possibility that 
Teneurins instruct class-specific PN-ORN connections through 
homophilic attraction: PNs expressing high-level Ten-m or Ten-a 
connect to ORNs with high-level Ten-m or Ten-a, respectively. 


Teneurins instruct matching specificity 


This homophilic attraction hypothesis predicts that overexpression of 
a given Teneurin in PNs (1) should preferentially affect PNs normally 
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Figure 4 | Teneurin overexpression in specific PN classes causes 
mismatching. a-l, Mismatching phenotypes following Ten-m (a-f) or Ten-a 
(g-l) overexpression in different PN classes. Specific PN classes are labelled by 
MARCM with Mz19-GAL4 and ORN axons using Or47b-rCD2 (a, d) or Or23a- 
mCD8GFP (g, j). In control, Mz19 PNs do not intermingle with Or47b ORNs 
(Fig. 1b). MARCM overexpression of Ten-m in DA1 PNs (a, arrowhead), but 
not VA1d/DC3 PNs (d), causes dendrite mismatching with Or47b axons. 
MARCM overexpression of Ten-a in VA1d/DC3 PNs (j, arrowhead), but not in 
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expressing low levels of that Teneurin, causing their dendrites to lose 
endogenous connections with their cognate ORNs, and (2) should 
cause these PNs to make ectopic connections with ORNs expressing 
high levels of that Teneurin. 

To test the first prediction, we examined whether Teneurin over- 
expression in Mz19 PNs impaired their endogenous connections with 
cognate ORNs. Consistent with our prediction, Ten-m overexpression 
specifically disrupted the connections of DA1 PNs and Or67d ORNs, a 
PN-ORN pair expressing low-level Ten-m (Supplementary Fig. 9b, e). 
Connections of the other two pairs were unaffected (Supplementary 
Fig. 9a, c, d, f). Likewise, Ten-a overexpression specifically disrupted 
connections between VA1d PNs and Or88a ORNs, a PN-ORN pair 
expressing low-level Ten-a (Supplementary Fig. 9g), but not between 
the other two PN-ORN pairs (Supplementary Fig. 9h, i). 

To test the second prediction, we examined the specificity of 
ectopic connections made by Mz19 PNs overexpressing Teneurins, 
and sampled five non-partner ORN classes that project axons to the 
vicinity of Mz19 dendrites (Supplementary Fig. 10). We found that 
Ten-m overexpression in Mz19 PNs caused their dendrites to mismatch 
only with Or47b ORNs (Supplementary Fig. 10f). To examine addi- 
tional mismatching phenotypes that may occur within Mz19 glomeruli 
and to determine whether DA1 or VA1d/DC3 PNs contribute to the 
ectopic connections, we used MARCM to overexpress Ten-m in indi- 
vidual PN classes. We found that Ten-m overexpression in DA1 PNs 
(Ten-m low) caused their dendrites to mismatch with Or47b (Fig. 4a, b) 
and (to a lesser extent) Or88a ORNs (Fig. 4b, c), both endogenously 
expressing high-level Ten-m. By contrast, Ten-m overexpression in 
VA1d/DC3 PNs did not produce ectopic connections with any non- 
matching ORNs tested (Fig. 4d-—f). 

Likewise, Ten-a overexpression in Mz19 PNs caused their dendrites 
to mismatch only with Or23a ORNs among all non-matching ORN 
classes sampled outside the Mz19 region (Supplementary Fig. 101). 
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DAI PNs (g), causes their dendrites to mismatch with Or23a axons. P{GS}9267 
and P{GE}1914 (Supplementary Fig. 2) are used to overexpress Ten-m and 
Ten-a, respectively. c, f, i, 1, Quantification of mismatching phenotypes (n = 9 
for each). Error bars represent s.e.m. See Supplementary Fig. 10 for details on 
some genotypes quantified here. b, e, h, k, Schematic summarizing the 
mismatching phenotypes in Fig. 4 and Supplementary Figs 9 and 10. Blue, Ten- 
m high; orange, Ten-a high. Scale bars, 10 um. 
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Further, MARCM overexpression of Ten-a in VA1d/DC3 PNs (Ten-a 
low) caused their dendrites to mismatch specifically with Or23a 
(Fig. 4j, k) and (to a lesser extent) Or67d ORNs (Fig. 4k, 1), both 
endogenously expressing high-level Ten-a (Fig. 41). By contrast, Ten-a 
overexpression in DA1 PNs (Ten-a high) did not produce ectopic 
connections with any non-matching ORNs tested (Fig. 4g-i). Thus, 
both Ten-m and Ten-a overexpression analyses support the homophilic 
attraction hypothesis. 

Our data also suggest that additional molecule(s) are required to 
determine completely the wiring specificity of the five PN-ORN pairs 
examined. For example, VAld—Or88a and VAl1m-Or47b have in- 
distinguishable Ten-m/Ten-a expression patterns (Fig. 2j), and may 
require additional molecules to distinguish target choice. Indeed, 
Ten-a knockdown (Fig. 3h—-j and Supplementary Fig. 8e, f) or Ten-m 
overexpression (Fig. 4b, c) caused DA1 PNs to mismatch preferentially 
with Or47b as opposed to Or88a axons. This suggests that the non- 
adjacent DA1 and VA11m share a more similar Teneurin-independent 
cell-surface code than the adjacent VA1d and VAI1Im. Likewise, Ten-a 
overexpression caused VA1d PNs to mismatch with the non-adjacent 
Or23a more so than the adjacent Or67d ORNs, even though both 
ORNs express high-level Ten-a (Fig. 4k, 1). Finally, Ten-m overexpres- 
sion in DC3 PNs, which express low-level Ten-m, did not change its 
matching specificity (Fig. 4f and Supplementary Fig. 9f), suggesting 
that Teneurin-independent mechanisms are involved in matching 
DC3 PNs and Or83c ORNs. 

In summary, we showed that Teneurin overexpression in Teneurin- 
low PNs caused their dendrites to lose endogenous connections 
with Teneurin-low ORNs and mismatch with Teneurin-high ORNs 
(Fig. 4b, k). However, Teneurin overexpression in Teneurin-high PNs 
did not disrupt their proper connections (Fig. 4e, h). These data indi- 
cate that Teneurins instruct connection specificity, probably through 
homophilic attraction, by matching Ten-m or Ten-a levels in PN and 
ORN partners. 


Ten-m promotes PN-ORN homophilic attractions 


To test whether Teneurins interact in vitro, we separately transfected 
two populations of Drosophila S2 cells with Flag- and haemagglutinin 
(HA)-tagged Teneurins, and performed co-immunoprecipitations 
from lysates of these cells after mixing. We detected strong homophilic 
interactions between Flag- and HA-tagged Ten-m proteins and, to a 
lesser extent, between Flag- and HA-tagged Ten-a proteins (Fig. 5a). 
Ten-m and Ten-a also exhibited heterophilic interactions (Fig. 5a), 
which may account for their role in synapse organization”®. 

Next, we tested whether Teneurins can homophilically promote 
in vivo trans-cellular interactions between PN dendrites and ORN 
axons. We simultaneously overexpressed Ten-m in Mz19 PNs using 
Mz19-QF, and Or67a and Or49a ORNs using AM29-GAL4 (ref. 32; 
Fig. 5b). This enabled us to label and manipulate independently Mz19 
dendrites and AM29 axons with distinct markers and transgenes. We 
chose AM29-GAL4 because of its early onset of expression, whereas 
other class-specific ORN drivers start to express only after PN-ORN 
connection is established**®. AM29 axons do not normally connect 
with Mz19 dendrites (Fig. 5c, d). 

Simultaneous overexpression of Ten-m in both Mzl9 PNs and 
AM29 ORNs produced ectopic connections between them (Fig. 5c, g), 
suggesting that Ten-m homophilically promotes PN-ORN attraction. 
By contrast, Ten-m overexpression only in PNs or ORNs did not 
produce any ectopic connections, despite causing dendrite or axon 
mistargeting, respectively (Fig. 5c, e, f). These data ruled out the 
involvement of heterophilic partners in Ten-m-mediated attraction. 
Simultaneous overexpression of Ten-a in Mz19 PNs and AM29 ORNs 
did not produce ectopic connections (data not shown), possibly due to 
lower expression or weaker Ten-a homophilic interactions (Fig. 5a). 
Although heterophilic interactions between Ten-m and Ten-a also 
occur in vitro (Fig. 5a), heterophilic overexpression of Ten-m and 
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Figure 5 | Ten-m promotes homophilic interactions in vitro and in vivo. 
a, Co-immunoprecipitation of Flag- and HA-tagged Teneurin proteins from 
separately transfected $2 cells. Co-immunoprecipitated HA-tagged Teneurin 
proteins are detected by anti-HA antibody and immunoprecipitated Flag- 
tagged Teneurin proteins by anti-Flag antibody (top two blots). The input 
lysates are immunoblotted for HA and Flag (bottom two blots). b, Schematic 
showing the relative positions of glomeruli targeted by Mz19 PN dendrites 
(green) and AM29 ORN axons (red). ¢, Quantification of mismatching between 
Mz19 PNs and AM29 ORNs (n = 10 for each condition). OE, overexpression. 
d, In control, Mz19 dendrites do not connect with AM29 axons. 

e, f, Overexpression of Ten-m only in AM29 ORNs (e) or Mz19 PNs (f) does 
not produce mismatching between them. Following Ten-m overexpression, 
AM29 axons mistarget posteriorly to Mz19 dendrites and are therefore not 
visible in e. g, Simultaneous overexpression of Ten-m in both PNs and ORNs 
produces ectopic Mz19-AM29 connections (arrowhead). Schematics on the 
right show the Mz19-AM29 connectivity in different conditions. h, The 
synaptic vesicle marker Synaptotagmin is enriched at these Mz19-AM29 
ectopic connections. AM29 axons are labelled by AM29-Gal4 with UAS-mtdT 
to visualize the entire axonal processes and UAS-synaptotagmin-HA to 
visualize synaptic vesicles in axon terminals. Mz19 dendrites are labelled by 
Mz19-QF driving QUAS-mCD8GEP. To overexpress Ten-m, P{GS}9267 and 
QUAS-ten-m (Supplementary Fig. 2) are driven by AM29-GAL4 and Mz19-QF, 
respectively. Scale bars, 10 jum. 


Ten-a in AM29 ORNs and Mz19 PNs did not produce ectopic con- 
nections (data not shown). 

Finally, we examined whether these ectopic connections lead to 
the formation of synaptic structures. Indeed, the ectopic connec- 
tions between Mz19 dendrites and AM29 axons were enriched in 
synaptotagmin-HA expressed from AM29 ORNs (Fig. 5h), suggest- 
ing that these connections can aggregate synaptic vesicles and could 
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be functional. We propose that Teneurins promote attraction between 
PN-ORN synaptic partners through homophilic interactions, even- 
tually leading to synaptic connections. 


Discussion 

Compared to axon guidance, relatively little is known about synaptic 
target selection mechanisms**. Among the notable examples, the 
graded expressions of vertebrate EphA and ephrin-A instruct the 
topographic targeting of retinal ganglion cell axons****. Chick 
DSCAM and Sidekick promote lamina-specific arborization of retinal 
neurons**. Drosophila Capricious promotes target specificity of 
photoreceptor and motor axons'**”-*’. Caenorhabditis elegans SYG-1 
and SYG-2 specify synapse location through interaction between pre- 
synaptic axons and intermediate guidepost cells*°. However, it is 
unclear whether any of these molecules mediate direct, selective 
interactions between individual pre- and postsynaptic partners. 
Indeed, in complex neural circuits, it is not clear a priori whether 
molecular determinants mediate such interactions. For example, the 
final retinotopic map is thought to result from both ephrin signalling 
and spontaneous activity“. Mammalian ORN axon targeting 
involves extensive axon—axon interactions through activity-dependent 
and independent modes**’, with minimal participation of post- 
synaptic neurons identified thus far. 

Here, we show that Teneurins instruct PN-ORN matching through 
homophilic attraction. Although each glomerulus contains many 
synapses between cognate ORNs and PNs, these synapses transmit 
the same information and can be considered identical with regard to 
specificity. Thus, Teneurins represent a strong case in determining 
connection specificity directly between pre- and postsynaptic neurons. 
We further demonstrate that molecular determinants can instruct 
connection specificity of a moderately complex circuit at the level of 
individual synapses. 

Our study reveals a requirement for PN-ORN attraction in the 
stepwise assembly of the olfactory circuit. PN dendrites and ORN 
axons first independently project to appropriate regions using global 
cues, dendrite-dendrite and axon-axon interactions®*”’"“. The initial, 
independent targeting of PN dendrites and ORN axons is eventually 
coordinated in their final one-to-one matching. We identified 
Teneurins as the first molecules to mediate this matching process, 
through direct PN-ORN attraction. Our analyses have focused on a 
subset of PN-ORN pairs involving trichoid ORNs”, including Or674, 
Or88a and Or47b that have been implicated in pheromone sensation*®. 
The partially overlapping expressions of Teneurins in other PN and 
ORN classes (Fig. 2 and Supplementary Fig. 4) suggest a broader 
involvement of Teneurins. At the same time, additional cell-surface 
molecules are also needed to determine completely connection specificity 
of all 50 PN-ORN pairs. 

Teneurins are present throughout Animalia (Fig. 1h). Different 
vertebrate teneurins are broadly expressed in distinct and partially 
overlapping patterns in the nervous system’’. Teneurin-3 is expressed 
in the visual system and is required for ipsilateral retinogeniculate 
projections’. Our study suggests that differential Teneurin expression 
may have a general role in matching pre- and postsynaptic partners. 
Indeed, high-level Ten-m is involved in matching select motor neurons 
and muscles*®. Furthermore, Ten-m and Ten-a also trans-synaptically 
mediate neuromuscular synapse organization”’. This suggests that the 
synapse partner matching function of Teneurins may have evolved 
from their basal role in synapse organization. Interestingly, synaptic 
partner matching only involves homophilic interactions (this study 
and ref. 26), whereas synapse organization preferentially involves 
heterophilic interactions”®. This could not be fully accounted for by 
the different strength of their homophilic and heterophilic interactions 
in vitro (Fig. 5a). We speculate that these dual functions of Teneurins 
in vivo may engage signalling mechanisms that further distinguish 
homophilic versus heterophilic interactions. 
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METHODS SUMMARY 


Detailed methods on fly stocks, generation of the ten-a allele, construction of 
transgenic flies, clonal analysis, histology, imaging, quantification and statistical 
analysis, epitope-tagged constructs, and co-immunoprecipitation can be found in 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Fly stocks. Mz19-GAL4 (ref. 7) was used to label PNs. Or-rCD2 lines’*** (Or47b- 
rCD2 and Or88a-rCD2), Or-mCD8GFP lines*® (Or23a-mCD8GFP, Or43a- 
mCD8GFP, Or46Aa-mCD8GFP, Or56a-mCD8GFP and Or83c-mCD8GFP), 
Or67d-GAL4 (ref. 49) and AM29-GAL4 (ref. 32) were used to label ORNs. 
GH146-Flp (ref. 9), ey-Flp (ref. 50), UAS>stop>mCD8GFP (ref. 9) and 
tubP>stop>GAL80 (ref. 51) were used to perform the intersectional expression 
analysis. P{GS}9267 (ten-m) was generated by the Drosophila Gene Search Project 
(Metropolitan University)” and P{GE}1914 (ten-a)'® was from the GenExel 
collection of EP lines generated by the Korean Advanced Institute of Science 
and Technology. Their ability to drive the overexpression of each respective 
Teneurin was verified by elevated antibody staining. 

All RNAi lines targeting ten-m or ten-a from the Vienna Drosophila RNAi 
Center (UAS-ten-m®%4*V°"73 and UAS-ten-a®NA‘ V4?) the Bloomington 
Drosophila Stock Center (UAS-ten-mRNA1I¥03323 and UAS-ten-aNAtI#03975), 
and the National Institute of Genetics Fly Stock Center (UAS-ten-m®\4'°778 
and UAS-ten-a®\4' 798) were collected. The efficiency of all RNAi lines was 
tested by pan-neuronal expression using C155-GAL4 followed by Ten-m or 
Ten-a antibody staining. UAS-ten-m®™4°Y°" and UAS-ten-m®NAFO%3 
targeting ten-m, and UAS-ten-a®N4"V9748? and UAS-ten-a®NA11F0997 targeting 
ten-a were able to eliminate respective antibody staining beyond detection. 
UAS-ten-m®N4*-V91173 and UAS-ten-a®%4"Y*4? were used in all the experiments 
except Supplementary Fig. 6, in which UAS-ten-m®N41¥°25. and UAS-ten- 
a®NA‘IFO3579 were used to confirm the RNAi phenotypes. UAS-Der2 (ref. 53) 
was used to enhance RNAi efficiency. 

To identify ten-m-GAL4, we collected a group of GAL4 enhancer traps” 
located near the 5’ end of the ten-m gene. Their expression patterns were deter- 
mined using a membrane-tagged GFP reporter UAS-mCD8GFP gene from the 
Drosophila Genetic Resource Center. NP6658-GAL4, which recapitulated the 
glomerulus-specific Ten-m staining pattern, was identified and referred to as 
ten-m-GAL4. 

Generation of the ten-a allele. A small deficiency of ten-a was generated by FRT- 
mediated excision. This deficiency allele is homozygous viable and contains a 
deletion between P{XP}d07540 and PBac{WH}f01428, which flanks the entire 
ten-a genomic region (~140kb) and four additional predicted genes (Sup- 
plementary Fig. 2a), and is referred to as Df(X)ten-a. The deletion was verified 
by both PCR and antibody staining against Ten-a. The loss-of-function pheno- 
types were due to the loss of ten-a rather than the four additional predicted genes, 
as they mimicked the RNAi phenotypes. 

Construction of UAS and QUAS transgenic flies. Ten-m and Ten-a coding 
sequences were amplified from the cDNA constructs******. One primer amplified 
from the start codon and added a CACC overhang for the TOPO reaction and a 
Kozak sequence. The other primer amplified to the stop codon. The PCR pro- 
ducts were subcloned into pENTR-D/TOPO (Invitrogen). A 46-bp irrelevant 
fragment was found in the middle of the ten-m coding sequence in the original 
cDNA construct, and was removed by replacing a small region containing this 
fragment with the corresponding region in the ten-m genomic DNA. To make 
UAS-ten-m, UAS-ten-a, QUAS-ten-m and QUAS-ten-a, pENTR-ten-m and 
pENTR-ten-a were recombined into destination vector pUASt-Gateway-attB”’ 
and pQUASt-Gateway-attB using LR Clonase II (Invitrogen). The destination 
vector pQUASt-Gateway-attB was constructed by replacing the UAS site in 
pUASt-Gateway-attB with a QUAS site. All constructs were sequence verified. 
All the UAS and QUAS transgenes were integrated into both attP24 and 86Fb 
landing sites***° on second and third chromosomes, respectively. All transgenic 
flies were verified by PCR and overexpression followed by antibody staining. The 
UAS and QUAS transgenes inserted in the 86Fb site were used in this paper. 
BAC recombineering to construct Mz19-QF. A 110-kb BAC (#CH321-85L03) 
in the attB-P[acman]-CmR vector®, which contains genomic DNA that covers the 
Mz19-GAL4 enhancer trap insertion site, was collected from the BACPAC 
Resources Center. The QF coding sequence, with a P-element minimal promoter 
and an hsp70 polyA, was amplified using primers containing 50-bp arm sequences 
allowing site-specific recombination. The 5-kb PCR product was recombined into 
the 110-kb genomic BAC using bacterial BAC recombineering and was verified by 
sequencing. The 115-kb Mz19-QF BAC was further verified by digestion pattern 
analysis and used to produce transgenes at the VK37 landing site*' on the second 
chromosome by BestGene. The Mz19-QF transgenic flies were verified by PCRand 
the expression of reporters QUAS-mCD8GFP or QUAS-mTdt3HA. 

Clonal analysis. To determine the contribution of individual PNs to the ectopic 
connections with ORNs, the MARCM method® was applied. Briefly, heat-shock- 
induced Flp activity caused mitotic recombination of the FRT chromosome arm such 
that one of the daughter cells lost GAL80. This cell (and its progeny) can therefore be 
labelled by the GAL4-UAS system. For generating neuroblast clones, flies were heat- 
shocked between 24-48 h after egg laying for 1h at 37 °C. Mz19-GAL4 labels VA1d 


and DC3 from the anterodorsal neuroblast and DA1 from the lateral neuroblast’. By 
generating neuroblast clones at 24-48h after egg laying, we used MZ19-GAL4 to 
specifically label DA1 or VA1d/DC3 PNs and simultaneously express RNAi targeting 
ten-a, or overexpress Ten-m or Ten-a in the labelled neurons. 

In the ten-a mutant analysis, Df(X)ten-a was placed in trans to GAL80 on the 
FRT chromosome arm. Upon Flp-induced mitotic recombination, one of the 
daughter cells became homozygous for ten-a and simultaneously lost GAL80. 
We used MZ19-GAL4 to specifically label DA1 or VA1d/DC3 mutant PNs. 

Different classes of ORNs, except for Or67d, were labelled by Or-mCD8GFP 
transgenes in a GAL4-independent manner, allowing the visualization of the 
specific matching between the labelled PNs and ORNs. Owing to the lack of an 
Or67d-mCD8GFEP, Or67d ORNs were labelled by Or67d-GAL4 and Ten-m over- 
expression was achieved by using Mz19-QF to drive QUAS-ten-m (Supplemen- 
tary Fig. 9). In Teneurin overexpression by Mz19-QF, Or67d-GAL4 expression 
was found unchanged compared with the control, and co-localized with Ncad 
staining in the DA1 glomerulus, which can be unambiguously identified 
(Supplementary Fig. 9). Therefore, Ncad staining in the DA1 glomerulus was 
used to determine the location of Or67d ORNs in Fig. 4f, 1, in which Teneurins 
were overexpressed by Mz19-GAL4. 

Histology. The procedures used for fixation and immunostaining were described 
previously’. For primary antibodies, we used mouse nc82 (1:30), rat antibody to 
N-cadherin (1:40), rat antibody to mCD8 (1:100), mouse antibody to rCD2 
(1:200), chicken antibody to GFP (1:1,000), mouse antibody to HA (1:1,000), 
rabbit antibody to HA (1:1,000), rabbit antibody to DsRed (1:500), mouse antibody 
(mAb20) to Ten-m (1:3,000)*, and guinea pig antibody to Ten-a (1:100)™. 
Neuropil staining indicates the antennal lobe, where PN dendrites and ORN axons 
are located. Fluorescent labelling outside the antennal lobe may come from labelled 
PN cell bodies or non-specific tissues. 

Imaging, quantification and statistical analysis. Immunostained brains were 
imaged with a Zeiss LSM 510 Meta laser-scanning confocal microscope. Images 
of antennal lobes were taken as confocal stacks with 1-j1m-thick sections. 
Representative single sections were shown to illustrate the matching and mismatch- 
ing between PN dendrites and ORN axons. Penetrance of phenotypes represents the 
percentage of animals in which at least one antennal lobe showed a given phenotype 
among the total animals examined. Percentage of intermingling represents the 
fraction of labelled dendrites located within the axonal area of a given ORN class, 
and was measured by dividing dendritic area by total axonal area in a single confocal 
plane that shows maximum intermingling between dendrites and axons. Statistical 
significance between two samples was determined by the unpaired Student’s t-test. 
Flag- and HA-tagged constructs. To express Flag- and HA-tagged proteins in $2 
cells, the Gateway destination vectors pUASt-Flag-Gateway(-w) and pUASt-HA- 
Gateway(-w) were generated by removing a ~4.5-kb non-essential fragment 
between two DrallI sites that contains the white gene from the original 
Gateway vectors pTFW and pTHW (Drosophila gateway collection, DGRC, 
Bloomington), respectively. The modified destination vectors are ~40% smaller 
than the original ones while preserving all the essential components for S2 cell 
expression, and showed greater transfection and expression efficiency in S2 cells. 
To express Flag- and HA-tagged Teneurin proteins in S2 cells, pENTR-ten-m and 
pENTR-ten-a were recombined into modified destination vectors pUASt-Flag- 
Gateway(-w) and pUASt-HA-Gateway(-w) using LR Clonase II (Invitrogen). All 
expression constructs, including UAS-Flag-ten-m, UAS-Flag-ten-a, UAS-HA- 
ten-m and UAS-HA-ten-a, were sequence verified. 

Co-immunoprecipitation assay. S2 cells were cultured in Schneider’s insect 
medium (Sigma) according to the manufacturer’s description. UAS-Flag-ten-m, 
UAS-Flag-ten-a, UAS-HA-ten-m or UAS-HA-ten-a constructs were separately trans- 
fected into S2 cells, along with an Actin5c-GAL4 vector, using the Effectene transfec- 
tion reagent (QIAGEN). The amount of each construct and the number of cells used 
for transfection were adjusted to ensure comparable expression levels of Ten-m and 
Ten-a proteins. Three days after transfection, separately transfected cells were har- 
vested, mixed together, and incubated for 1 h at room temperature (25 °C). Equivalent 
amounts of untransfected cells were used as controls, and the final mixtures contained 
the same total amount of cells under all co-immunoprecipitation conditions. The 
mixed cells were lysed in lysis buffer (50mM Tris-HCl pH 7.4, 10mM MgCh, 
150 mM NaCl, 1 mM EGTA, 10% glycerol) supplemented with 0.5% Nonidet P-40 
and protease inhibitor cocktail (Sigma). The cell lysates were then incubated with 
EZview Red anti-Flag M2 affinity gel (Sigma) for 3 hat 4 °C with rotation. The samples 
were washed extensively in lysis buffer. The proteins were eluted in 2% SDS elusion 
buffer, and were detected using western blot analysis using rat antibody to HA 
(1:1,000, Roche), mouse antibody to Flag (1:5,000, Sigma), and HRP-conjugated-goat 
antibodies to rat or mouse primaries (both at 1:20,000, Jackson ImmunoResearch) 
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Transcription factor PIF4 controls the 
thermosensory activation of flowering 


S. Vinod Kumar'*, Doris Lucyshyn'+*, Katja E. Jaeger’, Enriqueta Alds', Elizabeth Alvey’, Nicholas P. Harberd'? & Philip A. Wigge't 


Plant growth and development are strongly affected by small dif- 
ferences in temperature’. Current climate change has already 
altered global plant phenology and distribution”’, and projected 
increases in temperature pose a significant challenge to agricul- 
ture*. Despite the important role of temperature on plant develop- 
ment, the underlying pathways are unknown. It has previously 
been shown that thermal acceleration of flowering is dependent 
on the florigen, FLOWERING LOCUS T (FT)°**. How this occurs 
is, however, not understood, because the major pathway known to 
upregulate FT, the photoperiod pathway, is not required for thermal 
acceleration of flowering®. Here we demonstrate a direct mechanism 
by which increasing temperature causes the bHLH transcription 
factor PHYTOCHROME INTERACTING FACTORS (PIF4) to 
activate FT. Our findings provide a new understanding of how plants 
control their timing of reproduction in response to temperature. 
Flowering time is an important trait in crops as well as affecting 
the life cycles of pollinator species. A molecular understanding of 
how temperature affects flowering will be important for mitigating 
the effects of climate change. 

Arabidopsis thaliana, like many higher plants, responds to warmer 
ambient temperatures by increasing its growth rate and accelerating 
the floral transition’*”. Arabidopsis is a facultative long-day plant, and 
plants grown under short photoperiods are dramatically delayed in 
flowering. Interestingly, late flowering in short days can be overcome 
by growth at higher temperatures®. The underlying mechanism is, 
however, unknown. The flowering response to temperature is dependent 
on the floral pathway integrator gene FT’, indicative of a thermosensory 
pathway that upregulates FT expression independently of daylength. 
Because the bHLH transcription factor PIF4 has been shown to regulate 
architectural responses to high temperature*”, we tested if PIF4 is 
required for the induction of flowering at high temperature in short 
photoperiods. Although pif4-101 was slightly delayed in flowering at 
22°C, pif4-101 mutants showed a striking loss of thermal induction of 
flowering at 27°C (Fig. la, b). To test if pif4-101 perturbed floral 
induction by affecting FT expression, we examined the thermal induc- 
tion of FT in Col-0 and pif4-101. Although FT expression was strongly 
thermally inducible in Col-0, this response was largely abolished in 
pif4-101 at 27°C (Fig. 1c), indicating that PIF4 is necessary for the 
thermal acceleration of flowering in short days. By contrast, PIF4 is 
not required for the thermosensory induction of flowering under 
continuous light*, suggesting that the photoperiod pathway also 
interacts with the ambient temperature sensing pathway. The reduced 
role of PIF4 under continuous light probably reflects the instability of 
PIF4 in light’® coupled with the fact that the output of the photoperiod 
pathway, CONSTANS (CO) protein, is stabilized by light", shifting 
the balance of floral induction from PIF4 to the photoperiod 
pathway. Because PIF4 is necessary for the thermal induction of 
flowering in short days, we tested if it is sufficient to trigger flowering 
when overexpressed. 35S::PIF4 caused extremely early flowering 


(Fig. 1d, e), similar to the effect of overexpressing a related gene, 
PHYTOCHROME INTERACTING FACTORS (ref. 12), suggesting 
that PIF4 is limiting for the acceleration of flowering at lower temper- 
ature in short photoperiods. Consistently, 35S::PIF4 plants showed 
elevated levels of FT (Fig. 1f). Furthermore, 35S::PIF4 ft-3 showed a 
complete suppression of the early flowering phenotype, indicating that 
the induction of flowering by 35S::PIF4 was dependent on FT (Fig. 1g, h). 
This activation of FT appears to be independent of the established 
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Figure 1 | PIF4 is necessary for the thermal induction of flowering in short 
photoperiods. a, pif4-101 plants do not show acceleration of flowering at 27 °C 
compared with Col-0. Inset, a 16-week-old pif4-101 plant grown at 27 °C. 

b, Rosette leaf numbers at flowering for Col-0 and pif4-101 grown at 22 and 
27 °C in short-photoperiod conditions (error bars, SD; n = 6). c, FT expression 
as measured by quantitative polymerase chain reaction (qPCR) PCR in 4-week- 
old plants at 22 and 27 °C under short photoperiods in a PIF4-dependent 
manner (data from three biological replicates; error bars, SD). d, 35S::PIF4 
overexpression triggers very early flowering. e, Leaf numbers at flowering for 
Col-0 and 35S::PIF4 in long photoperiods (error bars, SD; n = 5). f, CO and FT 
gene expression data measured by qPCR in Col-0 and 35S::PIF4 at 21 °C in long 
photoperiods (samples taken 2 weeks after sowing; data from three biological 
replicates; all error bars, SD). g, FT is required for the early flowering phenotype 
of 35S::PIF4 plants. When crossed into the ft-3 background, the early flowering 
of 35S::PIF4 is completely suppressed. Inset, top view of the 35S::PIF4 ft-3 plant 
showing that petiole elongation growth is retained. h, Flowering time data for 
Col-0, Ler, 35S::PIF4, ft-3 and 35S::PIF4 ft-3 plants (error bars, SD; n = 5). 
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photoperiod pathway because CO did not change in response to 
35S::PIF4 (Fig. 1f). Finally, although co-9 mutants are late flowering®”’, 
we found 35S::PIF4 co-9 plants were early flowering, indicating that 
PIF4 acts largely independently of CO (Supplementary Information 
and Supplementary Fig. 1), consistent with the thermal induction of 
flowering being independent of the photoperiod pathway (Fig. 1f)°. 
Although PIF4 has been shown to be important for high-temperature 
responses, long-term increases in either PIF4 transcript or PIF4 protein 
levels in response to higher ambient temperature that can account for 
the observed growth responses have not been detected*”. To examine if 
variation of PIF4 transcription under our experimental conditions 
might account for the increases in PIF4 activity with temperature, 
we measured PIF4 transcript levels at 12, 17, 22 and 27 °C in seedlings 
(Fig. 2a). PIF4 transcript levels increased from 12 °C to 22 °C, whereas 
the difference between 22 °C and 27 °C was not statistically significant. 
Plants at 27°C, compared with 22°C, showed a very large PIF4- 
dependent response, suggesting that variation in the PIF4 transcript 
is not sufficient to account for the acceleration of flowering at 27 °C 
compared with 22 °C. To test whether temperature-mediated changes 
in PIF4 transcription are rate limiting for the biological response, we 
analysed the behaviour of plants constitutively expressing PIF4. 
Although 35S::PIF4 plants at 22 °C were extremely early flowering, this 
phenotype could be largely suppressed at 12°C (Fig. 2b and Sup- 
plementary Fig. 2), indicating that even when PIF4 transcript is abund- 
ant, lower temperatures are inhibitory for PIF4 activity. A possible 
explanation for this difference is that PIF4 protein is destabilized by 
low temperature. Indeed, PIF4 protein levels have already been shown 
to be strongly regulated by light’®, and growth in red and blue photo- 
cycles destabilizes PIF4 protein at low temperatures’. The PIF4 over- 
expression lines contain a fusion to the haemagglutinin (HA) epitope 
(35S::PIF4:HA). We therefore examined the levels of PIF4:HA, protein 
at 12, 17, 22 and 27 °C under the same light conditions used for our 
flowering time assays. Consistent with previous studies'® we saw a 
strong accumulation of PIF4 at the end of the night period, which 
was subsequently degraded during the day. Despite the suppression 
of early flowering in 35S::PIF4 at 12 °C compared with 22 °C (Fig. 2b), 
we did not observe an appreciable difference in PIF4 protein levels at 
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Figure 2 | Regulation of PIF4 by temperature. a, Transcriptional regulation 
of PIF4 by temperature. Ten-day-old Col-0 seedlings grown at 12, 17, 22 and 
27 °C under short photoperiods were analysed for PIF4 expression by qPCR. 
Data shown are from three biological replicates. Error bars, s.d. b, Flowering 
phenotype of 35S::PIF4 is temperature dependent. Although 35S::PIF4 plants 
(right) flower very early at 22 °C (upper panel) compared with Col-0 (left), this 
phenotype is largely suppressed by growth at 12 °C (lower panel). c, PIF4 
protein levels in 35S::PIF4 plants are not affected by growth temperature. 
Seven-day-old 35S::PIF4:HA seedlings grown at 17 °C were transferred to 12, 
17, 22 and 27 °C under short photoperiods for 2 days and samples were 
collected at the end of night before light and after 4h under illumination. 
Although PIF4:HA protein levels are independent of growth temperatures, the 
protein is robustly degraded in presence of light. PIF4:HA protein was detected 
by horseradish peroxidase (HRP)-conjugated HA antibody. Stained lower half 
of the gel used for immunoblot is shown as loading control. 
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these two temperatures that was likely to account for these different 
phenotypes (Fig. 2c and Supplementary Fig. 3). Slightly higher levels 
of PIF4:HA appeared to be present at 27°C (Fig. 2c), suggesting 
high-temperature stabilization of PIF4 may also contribute to higher 
PIF4 activity at 27 °C. 

Taken together, these data indicate that PIF4 regulates FT in a 
temperature-dependent manner. To determine if this is probably the 
case in planta, we analysed the spatial expression of FT and PIF4. FT 
has a distinctive pattern of expression in the vasculature of the leaf*>"®, 
and significantly PIF4 was expressed in the same domain (Fig. 3a). 
Because the regulation of FT by PIF4 could be either direct or indirect, 
we used chromatin immunopurification (ChIP) to analyse if PIF4 
binds directly to the FT promoter proximal to the transcriptional start 
site. This region of the promoter was chosen because it has been shown 
to be both phylogenetically conserved and the site for light-mediated 
regulation of FT expression'®’’. We observed robust enrichment of 
PIF4 near to the transcriptional start site (Fig. 3b), indicating that PIF4 
binds this region in vivo to activate FT expression. 


Mock ChIP 1 a ae 
— ee 


Input 


FT::GUS PIF4::GUS 
c 
Cl. C15 f 
e 
0.007 
0.006 
012°C BPE 
0.005 | 827°C 0.006 + oFT-f 
+e = = 0.005 
5 2 0.004 a3 
Es Ec 
ae 5 = 0.004 
= 2 0.003 = 
56 SS 0.003 
+5 xo 
re vee a & 0.002 
0.001 0.001 
0 1 1 ae, Q Le : 
FT-cl Fic FI-15 FT-f 17°C 22°C 27°C 
f g 
0.005 0.02 
aFT-c1 aFT-c 
aS 
oe aFT-c 
= 2B 0.004 +e 
ge 5a 
3% Es 
8 2 0.003 58 
Ns es 
& 8 0.002 +8 
mie 7 S 
0.001 a 
0 
17° 17°C 27°C (27°C Col-0 arp6-1 
Mock H2A.Z Mock H2AZ 


Figure 3 | PIF4 directly binds the FT promoter in a temperature-dependent 
manner. a, B-Glucuronidase (GUS) histochemical analysis of the expression 
domains of FT and PIF4 in rosette leaves. b, ChIP analysis shows PIF4 binding 
to the FT locus in vivo in seedlings. The At5g45280 promoter is a positive control 
for PIF4-binding activity”', the HSP70 promoter was used as a negative control. 
c, Summary of the structure of the FT promoter and positioning of qPCR 
amplicons for ChIP analysis. d, ChIP analysis of 35S::PIF4:HA at 12 and 27 °C 
(2-week-old seedlings, short photoperiods). e, ChIP analysis of PIF4::PIF4:ProA 
at 17, 22 and 27 °C (4-week-old soil-grown plants, short photoperiods). 

f, Analysis of H2A.Z occupancy at the FT locus at 17 and 27 °C (3-week-old 
plate-grown plants, short photoperiods). g, ChIP analysis of PIF4 binding to FT 
in Col-0 and arp6-1 (3-week-old soil-grown plants, 22 °C short photoperiod). 
(For all ChIP experiments, plant materials were collected at the end of dark 
period before lights came on and were protected from light until frozen. All data 
presented are from two independent ChIP experiments; all error bars, SD.) 
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Given the striking effect of ambient temperature on PIF4 activity, 
which occurs even when PIF4 is constitutively expressed, we hypothesized 
that the ability of PIF4 to bind the FT promoter may be temperature 
dependent. To test this, we performed ChIP experiments using 
35S::PIF4 plants grown at 12 and 27°C with primers flanking an 
E-box in the FT promoter (Fig. 3c). Strikingly, we observed a very strong 
temperature dependence for this binding, with an approximately 
fivefold increase in binding at 27 °C compared with 12°C (Fig. 3d). 
This indicates that the later flowering of 35S::PIF4 at 12 °C is caused bya 
decrease in PIF4 binding to FT. Because the 35S promoter causes strong 
ectopic expression of PIF4, we sought to confirm that PIF4 protein 
expressed at endogenous levels displays similar temperature-dependent 
binding to the FT promoter. We therefore performed ChIP experiments 
on a pif4-101 line complemented with PIF4,,,::PIF4:ProteinA (Sup- 
plementary Fig. 4). Consistent with the overexpression studies, we 
observed a strong increase in PIF4 binding to FT as a function of 
temperature. Reduced binding was observed at 17 °C, consistent with 
the very late flowering of plants under short days at low temperature, 
but this binding increased at 22°C and was even higher at 27°C 
(Fig. 3e). The temperature-dependent binding of PIF4 to FT could be 
due to growth temperature influencing the affinity of the PIF4 tran- 
scription factor for its binding site, or the efficiency of the ChIP could be 
affected by the temperature at which tissues were grown. To test these 
possibilities, we analysed another recently described PIF4 target 
locus'*, CYP79B2 (At4g39950), which is upregulated in 35S::PIF4 (Sup- 
plementary Fig. 5a). We found PIF4 binding to occur constitutively at 
both 12 and 27 °C at a region in the first exon (Supplementary Fig. 5b). 
Another region further upstream in the promoter showed a temper- 
ature-dependent binding of PIF4, and, in both cases, no enrichment was 
seen for a control locus (Supplementary Fig. 5b). This indicates that the 
abundant PIF4 protein we observed at 12 °C is active and able to bind 
target sites, and confirms that the ChIP method per se is not influenced 
by the temperature at which the sample is grown, consistent with other 
studies’. The ability of PIF4 to bind loci in a more temperature- 
independent manner might explain why 35S::PIF4 at 12 °C maintains 
hypocotyl and petiole elongation, while early flowering is strongly 
suppressed. We do not exclude that temperature may also influence 
PIF4 activity post-translationally. 

Temperature signals are mediated through H2A.Z-nucleosomes in 
Arabidopsis”, suggesting that temperature may be increasing the 
accessibility of the PIF4-binding site at the FT promoter. Consistent 
with this hypothesis, we found that H2A.Z-nucleosomes were present 
at the PIF4-binding site in the FT promoter. Furthermore, we found 
that the levels of H2A.Z-nucleosomes at the FT promoter decreased 
with higher temperature (Fig. 3f). These results suggest that the pres- 
ence of H2A.Z-nucleosomes is limiting for PIF4 binding to FT, and 
that the PIF4 binding we observed at higher temperature is due to the 
greater accessibility of chromatin containing H2A.Z-nucleosomes at 
higher temperature. This suggests that in the absence of H2A.Z- 
nucleosomes, PIF4 should bind FT more strongly. We therefore 
compared the ability of PIF4 expressed under its own promoter to 
bind to the FT promoter in wild type and arp6-1, a background lacking 
incorporation of H2A.Z-nucleosomes. Interestingly, we observed 
considerably greater binding of PIF4 in arp6-1 (Fig. 3g), indicating 
that H2A.Z-nucleosomes are rate limiting for PIF4 to activate FT 
expression. The eviction of H2A.Z-nucleosomes by higher temper- 
ature therefore provides a direct mechanism for the temperature- 
regulated expression of FT (Fig. 4c). Consistent with our previous 
results and the established role of H2A.Z in regulating temperature- 
dependent gene expression, we found that there is increased PIF4 
messenger RNA in arp6-1 background (Supplementary Fig. 6). 
However, our results for 35S::PIF4 suppression by 12 °C indicate that 
transcriptional upregulation of PIF4 is not the rate-limiting step in 
regulating PIF4-mediated flowering at higher temperatures. 

Our results indicate that the temperature-dependent regulation of 
FT by PIF4 is controlled at the level of chromatin accessibility of the 
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Figure 4 | PIF4 integrates environmental signals. a, Suppression of 
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nucleosome dynamics can regulate PIF4 binding to target loci for 
transcriptional activation. 


FT promoter and possibly at the level of PIF4 protein activity. PIF4 
activity is controlled through the repressive activity of DELLA proteins 
that prevent PIF4 binding DNA”. 

Consistently, plants having reduced or absent DELLA function are 
early flowering’. We hypothesized that delay in flowering at lower 
temperatures might at least in part be due to DELLA-mediated repres- 
sion of PIF4 activity. If so, it would be expected that absence of 
DELLAs should cause accelerated flowering at lower temperatures. 
In accord with this expectation, we found that a mutant lacking 
DELLAs flowered much earlier than wild type when grown at 12 °C 
(Fig. 4a, b). The phytohormone gibberellin triggers DELLA protein 
degradation, and plays a key permissive role for FT induction, because 
in a gibberellin-deficient background, gibberellin application increases 
FT expression 15-fold™*. Although it was proposed more than 50 years 
ago that gibberellins are upstream of florigen”’, the mechanism has not 
been clear. As DELLA proteins have been shown to be key regulators 
by which gibberellin influences PIF4, our finding that PIF4 is able to 
activate FT directly suggests a possible mechanism by which changes 
in gibberellin levels may influence flowering. 

Climate change has already caused measurable changes in plant 
phenology and behaviour’, and plants that incorporate temperature 
information into their life cycles appear to be able to adapt to warmer 
conditions more effectively than those that primarily rely on photo- 
period to synchronize their lifestyles*. The importance of the effects of 
climate change on yield are highlighted by the significant detrimental 
effects of increasing temperatures on yield*. PIF4 is a central integrator 
of environmental information in the plant and our finding that it 
activates FT at higher temperatures suggests it will be a key node for 
breeding crops resilient to climate change. This importance is sug- 
gested by the recent discovery that natural variation at PIF4 plays a 
major role in key ecological traits”®. 
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METHODS SUMMARY 


Detailed descriptions of the plant growth conditions, growth assays, transgenic 
constructs and ChIP techniques are provided in the Supplementary materials. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 26 July 2010; accepted 6 February 2012. 
Published online 21 March 2012. 


1. Samach, A. & Wigge, P. A. Ambient temperature perception in plants. Curr. Opin. 
Plant Biol. 8, 483-486 (2005). 

2. Fitter, A. H. & Fitter, R.S. Rapid changes in flowering time in British plants. Science 
296, 1689-1691 (2002). 

3. Willis, C.G., Ruhfel, B., Primack, R. B., Miller-Rushing, A. J. & Davis, C. C. 
Phylogenetic patterns of species loss in Thoreau’s woods are driven by climate 
change. Proc. Nat! Acad. Sci. USA 105, 17029-17033 (2008). 

4. Battisti, D. S. & Naylor, R. L. Historical warnings of future food insecurity with 
unprecedented seasonal heat. Science 323, 240-244 (2009). 

5. Halliday, K. J., Salter, M. G., Thingnaes, E. & Whitelam, G.C. Phytochrome control of 
flowering is temperature sensitive and correlates with expression of the floral 
integrator FT. Plant J. 33, 875-885 (2003). 

6. Balasubramanian, S., Sureshkumar, S., Lempe, J. & Weigel, D. Potent induction of 
Arabidopsis thaliana flowering by elevated growth temperature. PLoS Genet. 2, 
e106 (2006). 

7. Blazquez, M.A., Ahn, J. H. & Weigel, D. A thermosensory pathway controlling 
flowering time in Arabidopsis thaliana. Nature Genet. 33, 168-171 (2003). 

8. Koini, M. A. etal. High temperature-mediated adaptations in plant architecture 
require the bHLH transcription factor PIF4. Curr. Biol. 19, 408-413 (2009). 

9. Stavang, J. A. et al. Hormonal regulation of temperature-induced growth in 
Arabidopsis. Plant J. 60, 589-601 (2009). 

10. Nozue, K. et a/. Rhythmic growth explained by coincidence between internal and 
external cues. Nature 448, 358-361 (2007). 

11. Valverde, F. etal. Photoreceptor regulation of CONSTANS protein in photoperiodic 
flowering. Science 303, 1003-1006 (2004). 

12. Lorrain, S., Allen, T., Duek, P. D., Whitelam, G. C. & Fankhauser, C. Phytochrome- 
mediated inhibition of shade avoidance involves degradation of growth- 
promoting bHLH transcription factors. Plant J. 53, 312-323 (2008). 

13. Wenkel, S. et al. CONSTANS and the CCAAT box binding complex share a 
functionally important domain and interact to regulate flowering of Arabidopsis. 
Plant Cell 18, 2971-2984 (2006). 

14. Foreman, J. et al. Light receptor action is critical for maintaining plant biomass at 
warm ambient temperatures. Plant J. 65, 441-452 (2011). 

15. Takada, S. & Goto, K. Terminal flower2, an Arabidopsis homolog of 
heterochromatin protein1, counteracts the activation of flowering locus T by 
constans in the vascular tissues of leaves to regulate flowering time. Plant Cel/ 15, 
2856-2865 (2003). 


4 | NATURE | VOL 000 | 00 MONTH 2012 


16. Adrian, J. et a/. cis-regulatory elements and chromatin state coordinately control 
temporal and spatial expression of FLOWERING LOCUST in Arabidopsis. Plant Cell 
22, 1425-1440 (2010). 

17. Liu, H. et al. Photoexcited CRY2 interacts with CIB1 to regulate transcription and 
floral initiation in Arabidopsis. Science 322, 1535-1539 (2008). 

18. Franklin, K. A. et al. PHYTOCHROME-INTERACTING FACTOR 4 (PIF4) regulates 
auxin biosynthesis at high temperature. Proc. Nat! Acad. Sci. USA 108, 
20231-20235 (2011). 

19. Angel, A., Song, J., Dean, C. & Howard, M. A Polycomb-based switch underlying 
quantitative epigenetic memory. Nature 476, 105-108 (2011). 

20. Kumar, S. V. & Wigge, P. A. H2A.Z-containing nucleosomes mediate the 
thermosensory response in Arabidopsis. Cell 140, 136-140 (2010). 

21. de Lucas, M. et al. A molecular framework for light and gibberellin control of cell 
elongation. Nature 451, 480-484 (2008). 

22. Feng, S. etal. Coordinated regulation of Arabidopsis thaliana development by light 
and gibberellins. Nature 451, 475-479 (2008). 

23. Mutasa-Gottgens, E. & Hedden, P. Gibberellin as a factor in floral regulatory 
networks. J. Exp. Bot. 60, 1979-1989 (2009). 

24. Hisamatsu, T. & King, R. W. The nature of floral signals in Arabidopsis. |l. Roles for 
FLOWERING LOCUS T (FT) and gibberellin. J. Exp. Bot. 59, 3821-3829 (2008). 

25. Brian, P. W. Role of gibberellin-like hormones in regulation of plant growth & 
flowering. Nature 181, 1122-1123 (1958). 

26. Brock, M. T., Maloof, J. N. & Weinig, C. Genes underlying quantitative variation in 
ecologically important traits: PIF4 (phytochrome interacting factor 4) is associated 
with variation in internode length, flowering time, and fruit set in Arabidopsis 
thaliana. Mol. Ecol. 19, 1187-1199 (2010). 


Supplementary Information is linked to the online version of the paper at 
www.nature.com/nature. 


Acknowledgements We thank S. Prat, C. Fankhauser, K. Franklin, K. Goto, G. Coupland 
and D. Weigel for seeds. We are grateful to members of the Wigge laboratory for 
discussions. This work was supported in part by award No. KUK-I1-002-03 (to N.P.H.) 
made by King Abdullah University of Science and Technology and a Biotechnology and 
Biological Sciences Research Council (BBSRC) grant BB/I019022/1 (to S.V.K.). D.L. 
was supported by an Erwin Schroedinger Fellowship from the Austrian Science Fund 
FWF. P.A.W. was supported by start-up funds from the John Innes Centre and BBSRC, a 
BBSRC grant (BB/D0100470/1) and a European Research Council Starting Grant 
(ERC 243140). 


Author Contributions S.V.K., D.L., K.EJ. and E.A. performed the experiments. N.P.H. and 
P.A.W. designed the study and supervised the work. All authors discussed the results 
and made substantial contributions to the manuscript. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of this article at 
www.nature.com/nature. Correspondence and requests for materials should be 
addressed to P.A.W. (philip.wigge@slcu.cam.ac.uk). 


©2012 Macmillan Publishers Limited. All rights reserved 


METHODS 


Plant material and growth conditions. All plant lines used were in Col-0 
background unless otherwise specified. pif4-101 mutant was provided by 
C. Fankhauser, HA-tagged 35S::PIF4 by S. Prat'*. All references to “35S::PIF# 
and ‘PIF4:HA’ refer to this line: that is, 35S::PIF4:HA. FT::GUS was obtained from 
K. Goto’. 35S::PIF4:HA co-9 was obtained by crossing. The crosses were 
genotyped for presence of the 35S::PIF4 construct by PCR on genomic DNA using 
primers 2362 and 2363, resulting in two products of different size representing the 
complementary DNA (cDNA) transgene and the genomic DNA fragment, 
respectively. co-9 was genotyped with primers 3650 and 3652 for insertion, and 
3291 and 3292 for the wild-type fragment. For genotyping phyb-9, DNA was 
amplified using oligonucleotides 2137 and 2138 followed by Mnll digestion to 
distinguish between wild-type and mutant alleles. The global della mutant is in the 
Ler background and was described previously*. PIF4::PIF4:ProteinA and 
PIF4::PIF4:GUS were constructed by amplifying the genomic fragment of PIF4 
including the promoter with oligonucleotides 1534 and 1535. The PCR product 
was cloned into pENTR/D-TOPO (Invitrogen) and inserted into the binary plas- 
mids PW889 (carboxy-terminal ProteinA) and PW395 (carboxy-terminal GUS), 
respectively, using Gateway technology (Invitrogen). Transgenic plants were 
obtained by transforming pif4-101 by floral dip. For hypocotyl measurements, 
seeds were surface sterilized, sown on '/ MS media, stratified for 2 days at 4°C 
in the dark and germinated for 24h at 22 °C. The plates were then transferred to 
short-day conditions (8/16 h photoperiod) at 22 and 27 °C respectively, and grown 
vertically for 10 days before being imaged and the hypocotyl length measured 
using the Image] software (http://rsbweb.nih.gov/ij/). Oligonucleotide sequences 
are provided in Supplementary Table 1. 

Transcript analysis. Samples from plants grown in long days (16/8h photo- 
period) were collected and total RNA was extracted using Trizol Reagent 
(Invitrogen). RNA (2 ug) was treated with Dnasel (Roche) and used for cDNA 
synthesis (First strand cDNA synthesis kit, Fermentas). cDNA was diluted 1:8 and 
used for qPCR with a Roche Lightcycler 480 and the corresponding Sybr Green 
master mix. To detect FT transcript levels, oligonucleotides 3180 and 3181 were 
used; for CO, oligonucleotides 2951 and 2952. PIF4 transcript levels were analysed 
using oligonucleotides 3952 and 3953. Oligonucleotides 3247 and 3408 amplifying 
TUB6 (At5g12250) were used for normalization. 

Immunoblot analysis. To analyse the possible effect of temperature on PIF4 
protein stability, plants overexpressing PIF4:HA (35S::PIF4:HA) were used. Seven- 
day-old 35S::PIF4:HA seedlings grown in short days at 17 °C were transferred to 12, 
17, 22 and 27 °C in short days for 2 days. Samples were collected at end of night and 
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thereafter 30 min, 1 h and 4h under illumination. Protein samples were separated 
by SDS-polyacrylamide gel electrophoresis and transferred on to nitrocellulose 
membrane. PIF4:HA was detected using HRP-conjugated HA antibody (Miltenyi 
Biotech) and visualized by chemiluminscent detection using Immobilon 
Chemiluminescent HRP substrate (Millipore). 

GUS histochemical assay. For GUS-staining, plants were grown on 2 MS plates 
in long days (16/8 h photoperiods) for 10 days and kept in the dark for 24 h before 
collecting. Plants were stained in buffer containing 100 mM phosphate buffer, pH 
7, 10mM EDTA, 0.1% Triton-X100, 0.5 mM K-ferrocyanide and 1 mM X-Gluc at 
37°C for 24h before de-staining in ethanol. 

ChIP. ChIP was performed as described”? with minor modifications. 
35S::PIF4:HA seedlings were grown on 7/2 MS plates for 10 days and kept in the 
dark for 24h at respective temperatures before collecting. Plant tissue (1.5 g) and 
4 ug of antibody (HA-tag antibody ab9110 from Abcam) were used for ChIP. To 
analyse the dynamics of PIF4::PIF4:ProteinA, plants were grown in respective 
temperatures under short-day conditions for 4 weeks. Aerial parts of the plants 
were collected and cross-linked before being used for chromatin preparations. 
ChIP was done using magnetic beads (Dynabeads M-270 Epoxy, Invitrogen) 
coated with rabbit IgG (Sigma, 15006) as described (http://www.ncdir.org/ 
protocols/Rout/Conjugation%200f%20Dynabeads.pdf). To analyse H2A.Z 
dynamics at the FT locus in response to temperature, we used 3-week-old seedlings 
of HTA11::HTA11:GFP grown at 17 and 27°C. ChIP was done using anti-GFP 
antibody (Abcam, ab290). To analyse PIF4 binding in Col-0 and arp6-1 back- 
grounds, respective genotypes with PIF4::PIF4:3XFLAG were grown on soil at 
22°C under short photoperiods for 3weeks before samples were fixed by 
formaldehyde cross-linking. ChIP was performed using anti-Flag M2 affinity gel 
(Sigma A2220). Immuno-complexes were eluted using 3x Flag peptide (Sigma 
F4799) according to the manufacturer’s instructions. Immunoprecipitated DNA 
was eluted after reverse cross-linking by boiling at 95 °C for 1 min in the presence 
of 10% Chelex (BioRad laboratories) followed by treatment with Proteinase 
K. Oligonucleotides 3255 and 3256 for FT-15 region, 3613 and 3614 for FT-cl, 
3607 and 3608 for FT-c and 3261 and 3262 for FT-f were used for detecting 
PIF4 binding to the FT locus. As a positive control for PIF4 binding, 
At5g45280 was analysed using oligonucleotides 2857 and 2958. HSP70 was used 
as a negative control using oligonucleotides 1862 and 1865. To analyse PIF4 
binding at At4g39950, oligonucleotides 4240 and 4241 was used for region 1, 
and oligonucleotides 4246 and 4247 were used for region 2. Oligonucleotides 
1860 and 1861 were used for HSP70 as a negative control. Oligonucleotide 
sequences are provided in Supplementary Table 1. 
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The conversion of chemical energy into mechanical force by AAA+ 
(ATPases associated with diverse cellular activities) ATPases is integ- 
ral to cellular processes, including DNA replication, protein unfold- 
ing, cargo transport and membrane fusion’. The AAA+ ATPase 
motor cytoplasmic dynein regulates ciliary trafficking’, mitotic 
spindle formation’ and organelle transport*, and dissecting its 
precise functions has been challenging because of its rapid timescale 
of action and the lack of cell-permeable, chemical modulators. Here 
we describe the discovery of ciliobrevins, the first specific small- 
molecule antagonists of cytoplasmic dynein. Ciliobrevins perturb 
protein trafficking within the primary cilium, leading to their 
malformation and Hedgehog signalling blockade. Ciliobrevins also 
prevent spindle pole focusing, kinetochore-microtubule attach- 
ment, melanosome aggregation and peroxisome motility in cultured 
cells. We further demonstrate the ability of ciliobrevins to block 
dynein-dependent microtubule gliding and ATPase activity in vitro. 
Ciliobrevins therefore will be useful reagents for studying cellular 
processes that require this microtubule motor and may guide the 
development of additional AAA+ ATPase superfamily inhibitors. 

The AAA+ superfamily of enzymes couples ATP hydrolysis with 
the generation of mechanical force to regulate diverse aspects of 
prokaryote and eukaryote biology’. ATP-dependent conformational 
changes can propagate through these molecular machines to complete 
cellular processes within seconds, and chemical inhibitors that act 
rapidly and reversibly are much-needed tools for investigating the 
cellular functions of individual superfamily members. Yet, so far, only 
one AAA+ ATPase mechanoenzyme has been selectively targeted by a 
small molecule’. 

Werecently conducted a high-throughput screen for inhibitors of the 
Hedgehog (Hh) pathway (Fig. la)°, a key mediator of embryonic 
development and oncogenesis’. Our study was designed to identify 
compounds that act downstream of Smoothened (Smo), a transmem- 
brane Hh signalling protein, and one of the small molecules, HPI-4 
(Fig. 1b; 1), blocked Hh pathway activation in cells lacking the negative 
regulator Suppressor of Fused (Sufu)°. Prolonged treatment of cells with 
this benzoyl dihydroquinazolinone also reduced the number and size of 
primary cilia, a microtubule-based extension of the plasma membrane 
that is required for Hh signalling®. Intrigued by these cellular pheno- 
types, we investigated the biochemical mechanism of HPI-4. 

We first synthesized a series of analogues (Fig. 1b; 2-9) and evaluated 
their effects on Hh signalling and primary cilia formation (Fig. 1c, dand 
Supplementary Figs 1-3). Chemical derivatives lacking either a 3- or 
4-chloro substituent on the benzoyl ring system (2 and 6) or the acyclic 
ketone (9) were significantly less active in either assay (Fig. 1b, d). The 
other small molecules segregated into 2,4-dichlorobenzoyl dihydro- 
quinazolinones that inhibit both Hh signalling and primary cilia forma- 
tion (1, 3-5), which we henceforth name ‘ciliobrevins A-D’, and 
monochlorobenzoyl analogues that can block Hh target gene expres- 
sion without inducing ciliary defects (7 and 8) (Fig. 1b, d). 


Hh signalling is primarily mediated by the transcription factors Gli2 
and Gli3, which exist in a pathway state-dependent balance of amino 
(N)-terminal repressors (Gli2/3R), full-length polypeptides (Gli2/3FL) 
and transcriptional activators (Gli2/3A) (Fig. 1a)’. Both repressor 
and activator formation require the primary cilium*, and accordingly 
ciliobrevins altered the Gli3FL/Gli3R ratio in cells stimulated with the 
N-terminal domain of Sonic Hedgehog (Shh-N) (Fig. le; 30 .M doses 
of each compound). Shh-N-dependent Gli3FL phosphorylation was 
also reduced by these compounds, perhaps reflecting loss of Gli3A’. In 
contrast, none of the other analogues had a significant effect on the 
Gli3 processing or phosphorylation state (Fig. le). 

To understand better the basis of these phenotypes, we took advantage 
of the temporal control afforded by chemical perturbations. Although 
prolonged exposure to these compounds causes defects in axonemal 
morphology, shorter treatments can divulge ciliobrevin-sensitive pro- 
cesses within structurally intact cilia. Because Hh pathway activation 
coincides with Gli2 accumulation at the distal ciliary tip'®, we examined 
the effect of ciliobrevins on Gli2 localization (Fig. 1f). We incubated Hh- 
responsive cells with individual compounds at a 30 [1M concentration in 
the absence or presence of Shh-N-conditioned medium for 4h. Gli2 
localization was unchanged by derivatives that do not significantly per- 
turb ciliogenesis (2 and 8), whereas ciliobrevins A and D (1 and 5) 
induced ciliary Gli2 levels comparable to that in Shh-N-stimulated cells. 

The ability of ciliobrevins to increase ciliary Gli2 levels suggests 
that these compounds might target protein trafficking mechanisms 
within this organelle. Intraflagellar transport (IFT) can be resolved into 
anterograde trafficking, which requires the plus-end-directed motor 
kinesin-2 and the IFTB multisubunit complex, and retrograde traffick- 
ing, which uses the minus-end-directed motor cytoplasmic dynein 2 
and the IFTA complex*. Loss of the primary cilia-specific cytoplasmic 
dynein 2 heavy chain (Dync2h1) alters cilia morphology", reduces 
Hh target gene expression’’ and increases ciliary levels of Gli2". 
Similarities between these genetic phenotypes and the effects of 
ciliobrevins led us to hypothesize that these small molecules might 
inhibit cytoplasmic dynein 2. We therefore examined the effect of 
ciliobrevins on the subcellular localization of IFTB component 
IFT88, which requires cytoplasmic dynein 2-dependent retrograde 
transport for its return to the basal body. Treating cells for 1 hour with 
ciliobrevin D (5) but not DMSO or an inactive analogue (2) signifi- 
cantly increased IFT88 levels at the distal tip of primary cilia (Sup- 
plementary Fig. 4), providing further evidence that ciliobrevins inhibit 
cytoplasmic dynein 2 function. 

Cytoplasmic dynein complexes have other cellular functions, includ- 
ing the crosslinking and focusing of microtubule minus ends within the 
mitotic spindle’. These actions create the fusiform shape and localize 
y-tubulin-containing complexes to the spindle poles*. Cytoplasmic 
dynein 1 inhibition by blocking antibodies or dominant-negative 
constructs perturbs spindle assembly, resulting in disorganized poles 
and reduced y-tubulin recruitment*'*". To determine whether 
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Figure 1 | Ciliobrevins disrupt primary cilium-dependent Gli regulation. 
a, Depiction of Hh signalling with positive (blue) and negative (red) regulators. 
b, Dihydroquinazolinone structures. ¢, Ciliary phenotypes after treatment with 
individual compounds (30 LM) for 24h. Scale bar, 10 um. d, Compound effects 
on Hh target gene expression in Shh-LIGHT2 cells” and cilia size in Shh-EGFP 
cells®. Half-maximum inhibitory concentration (ICs) values are the average of 
three independent experiments + s.d., and cilia size is defined as the average 
number of Arl13b-positive pixels per nuclei for ten micrographs ~ s.d., each 
containing approximately 150 cells. Small molecules that block Hh signalling 
alone (blue) and those that inhibit both Hh pathway activity and ciliogenesis 


ciliobrevins recapitulate these phenotypes, we treated a metaphase- 
enriched population of NIH-3T3 cells with 50 uM of either ciliobrevin 
D (5) or an inactive analogue (2) for 1 hour and examined their mitotic 
structures. Cells treated with ciliobrevin D showed abnormal (unfocused, 
multipolar or collapsed) spindles with disrupted y-tubulin localization 
(Fig. 2a, b and Supplementary Fig. 5a), whereas cells incubated with the 
non-cilia-disrupting analogue or vehicle alone showed normal spindle 
morphologies. Similar ciliobrevin-induced spindle defects were observed 
in HeLa cells, although to a lesser extent (Fig. 2b). Cytoplasmic dynein 1 
is also required for establishing stable kinetochore-microtubule 
interactions’®, and ciliobrevin D treatment disrupted the formation of 
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(ciliobrevins; red) are highlighted. e, Gli3FL and Gli3R levels in Shh-EGFP cells 
treated with Shh-N and either individual dihydroquinazolinones (30 1M), 3 uM 
cyclopamine or DMSO for 16h. Average Gli3FL/GIi3R ratios for five 
independent experiments + s.e.m. and a representative immunoblot are shown, 
with asterisks indicating P < 0.005 for individual compounds versus DMSO and 
cyclopamine (Cy). f, Ciliary Gli2 levels (a.u., arbitrary units) in Shh-EGFP cells 
treated with selected ciliobrevins (1 and 5), inactive analogues (2 and 8) or 
DMSO for 4 h. Average Gli2 levels in the distal end of at least 25 cilia + s.e.m. and 
representative confocal micrographs are shown. Asterisks indicate P < 0.005 for 
individual compounds versus DMSO. Scale bar, 2 um. 


cold-stable microtubules that mediate proper spindle-chromosome 
attachments (Fig. 2c). 

To investigate if these spindle-disruptive effects were associated 
with altered dynein localization, we examined binding partners that 
recruit or co-localize with this motor. Immunofluorescence micro- 
scopy showed that p150-Glued, a dynactin component proposed to 
recruit dynein to kinetochores and spindle poles during mitosis’®, was 
localized to the disorganized spindle poles of ciliobrevin D-treated cells 
(Fig. 2d and Supplementary Fig. 5b). Kinetochore targeting of p150- 
Glued was also unaffected by ciliobrevin D, as assessed in nocodazole- 
treated cells to obviate effects due to microtubule-attachment status 
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Figure 2 | Ciliobrevins disrupt spindle pole assembly and kinetochore- 
microtubule attachment. a, Mitotic spindles observed in NIH-3T3 cells 
treated with MG132 for 90 min and subsequently cultured with either an 
inactive analogue (2) or ciliobrevin D (5) at a 50 uM dose or DMSO for 1h. 
Staining for DNA, «-tubulin and y-tubulin is shown. b, Quantification of 
spindle phenotypes in NIH-3T3 and HeLa cells treated as described above and 
scored for either bipolar or abnormal morphologies. Data are the average of 
three independent experiments + s.e.m., each including at least 150 spindles. 
c, Kinetochore—-microtubule interactions analysed in metaphase-arrested NIH- 
3T3 cells treated with DMSO or 50 uM 5 and then incubated on ice for 10 min. 
Staining for DNA, the kinetochore marker CREST and «-tubulin is shown. 
Insets highlight individual kinetochore-microtubule attachments or 
untethered kinetochores (400% magnification). d, Localization of p150-Glued 
in metaphase-arrested NIH-3T3 cells treated with DMSO or 50 uM 5. Staining 
for DNA, a-tubulin and p150-Glued is shown. Scale bars: a, 4 Lum; ¢, d, 5 Lum. 


(Supplementary Fig. 6). We similarly observed that Zeste white 10 
(Zw10), a component of the Rod/Zw10/Zwilch complex that recruits 
cytoplasmic dynein 1 to kinetochores”’, and the kinetochore-associated 
protein Centromere protein E (CENP-E)’* correctly localized to kine- 
tochores under these conditions (Supplementary Fig. 6), indicating 
that their recruitment and the kinetochore structure itself are not dis- 
rupted by ciliobrevins. Thus, ciliobrevin-induced spindle phenotypes 
most likely result from dynein inhibition rather than mislocalization. 

To characterize further the mitotic defects associated with ciliobrevin 
treatment, we conducted real-time confocal microscopy of green fluor- 
escent protein (GFP)-tubulin-expressing NIH-3T3 cells (Supplemen- 
tary Figs 7, 8). Within minutes of ciliobrevin D treatment, spindles 
collapsed, bipolarity was lost and spindle poles appeared disorganized 
(Supplementary Fig. 7b). Upon compound washout, bipolar spindles 
quickly re-emerged and chromosomes segregated at anaphase without 
any pronounced defects (Supplementary Fig. 7c, d). Ciliobrevin D 
addition also reversibly disrupted the pre-formed spindles of 
metaphase-arrested cells (Supplementary Fig. 8b, c) and reduced over- 
all microtubule levels (Supplementary Fig. 8d). These latter effects are 
mitosis-specific, as microtubule levels in non-dividing cells were un- 
affected by inhibitor treatment (Supplementary Fig. 7e, f). Taken 
together, these results reveal that cytoplasmic dynein is not only 
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required for spindle pole assembly but also actively participates in its 
maintenance. 

Cytoplasmic dynein 1 also regulates organelle trafficking, such as 
the melatonin-induced aggregation of melanosomes in Xenopus 
melanophores”. To determine if melanosome trafficking is sensitive 
to ciliobrevins, we cultured Xenopus melanophores with melanocyte- 
stimulating hormone to disperse these pigment granules and then 
treated the cells with melatonin and various concentrations of 
ciliobrevin D (5) or an inactive analogue (2). Ciliobrevin D reversibly 
inhibited melanosome aggregation, but the non-cilia-disrupting 
derivative had no discernible effect at comparable doses (Fig. 3a, b 
and Supplementary Movies 1-3). Ciliobrevin D similarly abrogated 
the movement of peroxisomes in Drosophila S2 cells (Fig. 3c-e and 
Supplementary Movies 4 and 5), consistent with the role of cytoplasmic 
dynein in their bidirectional motility”. 

Collectively, these results indicate that ciliobrevins are specific, 
reversible inhibitors of disparate cytoplasmic dynein-dependent pro- 
cesses. Ciliobrevins do not perturb cellular mechanisms that are inde- 
pendent of dynein function, including actin cytoskeleton organization 
(Supplementary Fig. 9) and the mitogen-activated protein kinase and 
phosphoinositol-3-kinase signalling pathways’. To examine more 
directly whether cytoplasmic dynein is the direct target of ciliobrevins, 
we evaluated their effects on dynein-dependent microtubule gliding 
in vitro. Ciliobrevins A and D (1 and 5) retarded the ATP-dependent 
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Figure 3 | Ciliobrevins inhibit melanosome aggregation and peroxisome 
motility. a, Brightfield images of Xenopus melanophores treated with various 
concentrations of ciliobrevin D (5) or an inactive derivative (2) for 10 min, 
stimulated with melatonin in the presence of the compounds for 30 min, and 
then paraformaldehyde-fixed. b, Melanophores treated with 5 as before, washed 
in medium containing melatonin alone, and then imaged. Scale bars, 100 jim. 
c, d, Motility of GFP-labelled peroxisomes in Drosophila 82 cells cultured in the 
absence (c) or presence (d) of 5. Overlays of videomicroscopy frames at t = 0s 
(red) and t = 10s (green) are shown. Scale bar, 5 um. e, Vector distributions for 
the GFP-labelled peroxisomes with movement towards and away from the cell 
centre denoted by negative and positive bin values, respectively. 
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movement of fluorescently labelled microtubules on bovine cytoplasmic 
dynein-coated glass slides in a reversible and dose-dependent manner 
(Fig. 4a—c, Supplementary Fig. 10 and Supplementary Movies 6-8); 
analogues that did not perturb cytoplasmic dynein-dependent processes 
in our cell-based assays (2 and 8) had minimal effects (Fig. 4a, b and 
Supplementary Movie 9). The conserved structure-activity relation- 
ships of dihydroquinazolinones in the microtubule gliding and cell- 
based assays confirm cytoplasmic dynein as the ciliobrevin target, 
and neither ciliobrevin A nor D significantly affected K560/kinesin- 
1-dependent microtubule gliding in vitro at 100 uM concentrations 
(Fig. 4d and Supplementary Movies 10-12). The compounds do not 
broadly target members of the AAA+ ATPase family either, as they 
have no effect on p97-dependent degradation of endoplasmic-reticulum- 
associated proteins (Supplementary Fig. 11) or Mcm2-7-mediated 
DNA unwinding (Supplementary Fig. 12). 
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Figure 4 | Ciliobrevins inhibit cytoplasmic dynein-dependent microtubule 
gliding and ATPase activity. a, Montages of fluorescent microtubules moving 
on bovine dynein-coated glass slides in the presence of ATP and either DMSO, 
ciliobrevin A (1), ciliobrevin D (5) or non-cilia-disrupting analogues (2 and 
8). All compounds were tested at a 100 1M concentration and the vertically 
stacked images were acquired 2 s apart. Scale bar, 10 um. b, Quantification of 
the compounds’ effects on dynein-dependent microtubule gliding. Data are the 
average velocities for at least 56 microtubules + s.e.m. Asterisks indicate 

P<10 °and at least 30% inhibition for individual compounds versus DMSO. 
c, Dose responses of 1 and 5 in the dynein-dependent assay. Data are the 
average velocities for at least 45 microtubules + s.e.m. d, Effects of 1, 2, 5,8 and 
the competitive ATP antagonist adenylyl imidodiphosphate (A-P) on 
microtubule gliding driven by the K560 N-terminal fragment of kinesin-1. 
Dihydroquinazolinones and AMP-PNP were tested at a 100 1M and 1 mM 
concentrations, respectively. Data are the average velocities for at least 

34 microtubules + s.e.m., and asterisks indicate P < 0.01 for individual 
compounds versus DMSO. e, Effects of 1, 2 and 5 on the ATPase activities of 
recombinant motor domains derived from rat cytoplasmic dynein, human 
kinesin-1 and human kinesin-5. Compound concentrations in micromolar 
units are shown, and data are the average ATPase activities for two independent 
experiments + s.d. f, Hanes—Woolf plot of rat dynein motor ATPase velocity 
(V) suggesting the nucleotide-competitive activity of 5. 
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We next investigated how ciliobrevins abrogate cytoplasmic dynein 
function. Neither ciliobrevin A nor D was able to disrupt the association 
between ADP-bound dynein and microtubules, as determined in a 
co-sedimentation assay (Supplementary Fig. 13). Both small molecules, 
however, were able to inhibit the ATPase activity of bovine brain 
cytoplasmic dynein in a concentration-dependent manner, whereas 
their inactive analogue (2) could not (Supplementary Fig. 14). The 
compounds had analogous effects on the ATPase activity of recom- 
binant rat dynein motor domain, but none significantly inhibited the 
ATPase activities of recombinant motor domains derived from human 
kinesin-1 or kinesin-5 (Fig. 4e). Ciliobrevin efficacies at various ATP 
concentrations suggest that these small molecules act in a nucleotide- 
competitive manner (Fig. 4f). Consistent with this mechanism, 
ciliobrevin A also inhibited ultraviolet-light-induced cleavage of the 
cytoplasmic dynein motor domain in the presence of sodium vanadate 
and ATP (Supplementary Fig. 15). 

Our studies establish ciliobrevins as the first small molecules known 
specifically to inhibit cytoplasmic dynein in vitro and in live cells. 
Although the ATP analogue erythro-9-[3-(2-hydroxynonyl)]adenine”" 
and the antioxidant nordihydroguaiaretic acid” have been previ- 
ously reported to abrogate dynein function, these compounds are 
promiscuous enzyme antagonists****. The natural product purealin 
can partly inhibit dynein ATPase activity in vitro”, but its ability to 
block cytoplasmic dynein-dependent cellular processes has not been 
demonstrated. Our studies indicate that ciliobrevins can inhibit both 
cytoplasmic dynein 1 and 2, and, accordingly, the compounds will be 
broadly applicable probes of dynein-dependent processes. Further 
development of ciliobrevin-like molecules could lead to isoform- 
selective inhibitors of this minus-end-directed microtubule motor 
and perhaps specific antagonists of other AAA+ ATPase superfamily 
members. 


METHODS SUMMARY 

Hh signalling assays. Hh signalling and Gli3 processing assays were performed as 
described*’. 

Cell imaging. Cilia were immunostained with anti-Arl13b antibody (T. Caspary), 
and cilia size was determined by dividing the number of Arl13b-positive pixels by 
the number of nuclei per image. Ciliary levels of Gli2 and IFT88 were determined by 
immunostaining cells with anti-Gli2 (R&D Systems) or anti-IFT88 (ProteinTech 
Group) antibodies. Analyses of mitotic spindles and kinetochore-microtubule 
attachments in fixed cells were performed as described”’, and real-time confocal 
microscopy was conducted with NIH-3T3 cells stably expressing GFP-tubulin. 
Melanosome and peroxisome motility studies were conducted as reported'*”°. 
Dynein activity assays. Bovine brain dynein was purified as described”, and its 
microtubule gliding activity was assayed as reported’* with modifications. 
Recombinant rat cytoplasmic dynein motor domain was heterologously expressed 
and purified as described”. ATPase activities were determined using a Malachite 
Green assay (Novus Biologicals). 

Full Methods are in the Supplementary Information. 
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Targeted therapies have demonstrated efficacy against specific subsets 
of molecularly defined cancers'*. Although most patients with lung 
cancer are stratified according to a single oncogenic driver, cancers 
harbouring identical activating genetic mutations show large varia- 
tions in their responses to the same targeted therapy’’. The biology 
underlying this heterogeneity is not well understood, and the impact 
of co-existing genetic mutations, especially the loss of tumour sup- 
pressors’’, has not been fully explored. Here we use genetically 
engineered mouse models to conduct a ‘co-clinical’ trial that mirrors 
an ongoing human clinical trial in patients with KRAS-mutant 
lung cancers. This trial aims to determine if the MEK inhibitor 
selumetinib (AZD6244)"° increases the efficacy of docetaxel, a 
standard of care chemotherapy. Our studies demonstrate that con- 
comitant loss of either p53 (also known as Tp53) or Lkb1 (also 
known as Stk1 1), two clinically relevant tumour suppressors®”""”, 
markedly impaired the response of Kras-mutant cancers to 
docetaxel monotherapy. We observed that the addition of 
selumetinib provided substantial benefit for mice with lung cancer 
caused by Kras and Kras and p53 mutations, but mice with Kras 
and Lkb1 mutations had primary resistance to this combination 
therapy. Pharmacodynamic studies, including positron-emission 
tomography (PET) and computed tomography (CT), identified 
biological markers in mice and patients that provide a rationale 
for the differential efficacy of these therapies in the different 
genotypes. These co-clinical results identify predictive genetic 
biomarkers that should be validated by interrogating samples from 
patients enrolled on the concurrent clinical trial. These studies also 
highlight the rationale for synchronous co-clinical trials, not only 
to anticipate the results of ongoing human clinical trials, but also to 
generate clinically relevant hypotheses that can inform the analysis 
and design of human studies. 

Activating KRAS mutations are found in 15-30% of all patients with 
non-small cell lung cancer (NSCLC), and predict poor outcome in 
response to conventional treatment regimens’**. Preclinical studies 
have suggested that inhibition of MAPK/ERK kinase (MEK) may be 
effective against KRAS-mutant NSCLC’, prompting an ongoing 


human clinical trial comparing docetaxel monotherapy (standard of 
care) to docetaxel combined with the MEK inhibitor selumetinib 
(AZD6244). Although the sole genetic entry criteria for patients on 
this trial is the presence of KRAS mutations, the complexity of NSCLC 
dictates that many tumours will harbour concomitant genetic altera- 
tions that may modulate response to therapy. To mirror this human 
clinical trial in a murine co-clinical trial, and to investigate the modu- 
lating effects of concomitant tumour suppressor loss, we generated 
cohorts of genetically engineered mice with either Kras, Kras and 
p53 (Kras/p53) or Kras/Lkb1 mutant lung cancers. Activation of 
Kras(G12D) and inactivation of p53 or Lkb1 in the lung epithelium 
was achieved using nasal instillation of adenovirus encoding the CRE 
recombinase’. Mice with established disease, defined by tachypnoea, 
hypoxaemia on pulse oximetry’®, and bulk disease on magnetic res- 
onance imaging (MRI)'’, were randomized to receive either docetaxel 
16 mgkg ' every other day by intraperitoneal injection’* (Supplemen- 
tary Table 1), selumetinib at 25mgkg ' daily by oral gavage'®, or 
docetaxel in combination with selumetinib. Treatment response was 
determined by serial MRI. Tumour volumes were reconstructed from 
the MRI images (Supplementary Fig. 1a) with a high level of inter- 
operator reliability (Supplementary Fig. 1b; 95% confidence interval, 
—25.6% to +31.4%). On the basis of these performance metrics, and 
paralleling human response criteria, we used a threshold of 30% 
change in tumour volume to define progressive disease and partial 
response. 

For tumours with only Kras mutation, treatment with docetaxel 
monotherapy resulted in a modest rate of response, with 30% of mice 
achieving a partial response (Fig. 1a, c). Mice bearing Kras tumours with 
concurrent loss of p53 or Lkb1 had markedly lower response rates to 
docetaxel treatment (5% and 0%, respectively), and more of these ani- 
mals demonstrated progressive disease on MRI or progression to mor- 
ibundity (Fig. la and Supplementary Table 2). The addition of 
selumetinib to docetaxel treatment provided substantial benefit 
(Fig. 1b, c), with the overall response rate increased to 92% in Kras- 
mutant cancers (P = 2.8 X 10°, Fisher exact test compared to docetaxel 
alone) and 61% in Kras/p53 mice (P = 2.7 X 10 “*). In contrast, for 
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Kras/Lkb1 mutant cancers the addition of selumetinib to docetaxel led 
to only a modest improvement in overall response, with 33% of the 
mice achieving a partial response (Fig. 1b, c). Compared to the other 
genotypes, Kras/Lkb1 mice had a significantly lower rate of response to 
combined treatment with selumetinib and docetaxel (P = 0.0009, 
3 X 2 contingency Fisher exact test). 

The magnitude of change in volume confirmed that tumours with 
Kras or Kras/p53 mutations were considerably more responsive to 
combination therapy compared to docetaxel alone. In contrast, the 
addition of selumetinib did not significantly reduce the volume of 
tumours with compound Kras/Lkb1 mutations (Fig. 1d and Sup- 
plementary Fig. 1c, d). Concordantly, histopathological assessments 
of tumours collected after two doses of treatment revealed that the 
combination treatment increased apoptosis (Fig. le and Supplemen- 
tary Fig. 2a) and reduced proliferation (Fig. 1fand Supplementary Fig. 2b) 
in the Kras and Kras/p53 tumours compared to docetaxel alone, but 
this was not observed in Kras/Lkb1 tumours (Fig. le, fand Supplemen- 
tary Fig. 2a, b). These results demonstrate that combined treatment 
with selumetinib and docetaxel induces apoptosis and decreases pro- 
liferation in Kras and Kras/p53 tumours, leading to antitumour effi- 
cacy, but that concomitant mutation of Lkb1 confers primary 
resistance to the combination treatment. 

Because repeated tumour biopsies are difficult in patients, we explored 
the use of '“F-fluoro-2-deoxy-glucose PET (FDG-PET) as an early 
response indicator that could be used in the clinic. Comparison of 
FDG avidity, quantified by standardized uptake value (SUV)*°” in lung 
cancers across the three different genotypes showed an overall higher 
FDG uptake in both Kras/p53 and Kras/Lkb1 tumours compared to 
Kras tumours (Fig. 2a; P = 0.02, one-way ANOVA). Expression of the 
glucose transporter GLUT1 (also known as SLC2A1) was elevated in 
Kras/Lkb1 mutant tumours (Supplementary Fig. 3a), consistent with 
the increased baseline FDG-PET signal. To determine if this finding 
was applicable to human patients, we determined the pre-treatment 


2 | NATURE | VOL 000 | 00 MONTH 2012 


FDG avidity in nine patients with KRAS-mutated lung cancer. 
Tumours from three patients positive for LKB1 immunostaining had 
a mean maximum SUV (SUV,,,,) of 2.33, whereas tumours from six 
patients negative for LKB] immunostaining had a mean SUV max Of 8.75 
(Fig. 2b; P = 0.048, two-sided Wilcoxon). 

We next used FDG-PET to assess early tumour metabolic changes 
after initiation of therapy. Treatment with docetaxel alone did not result 
in significant changes in tumour hypermetabolism in Kras-, Kras/p53- 
or Kras/Lkb1-tumour-bearing mice (Fig. 2c, d). Of note, some of the 
murine Kras lung cancer nodules were not FDG avid (Fig. 2a) and these 
were the most sensitive to single-agent docetaxel (data not shown). In 
contrast, within 24h of the first dose of treatment with docetaxel and 
selumetinib, tumour hypermetabolism was markedly suppressed in 
both Kras and Kras/p53 mice (Fig. 2c, d). However, Kras/Lkb1-mutant 
tumours had no appreciable decrease in FDG avidity when treated with 
the combination (Fig. 2c, d). Together, these results demonstrate that 
early changes in tumour metabolism measured by FDG-PET (Fig. 2c, 
d) are concordant with histopathological analysis of apoptosis and 
proliferation (Fig. le, f) and predict antitumour efficacy (Fig. la-c) 
of docetaxel and selumetinib in treating Kras-mutant lung cancers. 

To assess the pharmacodynamic effects of treatment on the MEK- 
ERK signalling axis, we assayed pathway activation using phospho- 
ERK immunostaining of lung cancer nodules. At baseline, the ERK 
pathway was most activated in Kras/p53-mutant tumours (Fig. 3a, b). 
We observed substantially less phospho-ERK staining in Kras/Lkb1 
tumours, suggesting that the MEK-ERK pathway is not highly activated 
in these cancers. Treatment with docetaxel did not alter phospho-ERK 
staining, but, as expected, the addition of selumetinib decreased MEK- 
ERK signalling in the Kras and Kras/p53 tumours (Fig. 3a, b). 

We further evaluated cellular signalling from short-term-treated 
lung cancer nodules by immunoblotting tumour lysates. Concordant 
with immunostaining (Fig. 3a), elevated phospho-ERK and phospho- 
9ORSK were observed in Kras/p53 tumours relative to the other genotypes 
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Figure 2 | FDG-PET predicts treatment 
response. a, FDG-PET signal intensity (SUVimax) 
in Kras, Kras/p53 and Kras/Lkb1 mutant mice. 


Kras 


Statistical significance determined by rank sum 
test, with *P < 0.05 for Kras compared to Kras/p53 
mutant mice (P = 0.019), and Kras compared to 
Kras/Lkb1 mutant mice (P = 0.014). b, FDG-PET 


Kras/p53 Kras/Lkb1 é ; ty , : 
signal intensity in patients with KRAS or KRAS/ 


Kras 


ry 
Kras/p53 Kras/Lkb1 KRAS KRAS/LKB1 


d 
xo) 
8 
o 
ro) 
fe) 
(a) 
an. Cs. 2. sf 
‘O Baseline 24-h post-Rx Baseline 24-h post-Rx 
c= Kras Kras/p53 
£ 
a #6359 . W 43305 YE, 
n > 
+ Wad 
xo) 
x 
i] 
8 
[e) ‘\ oc « . ‘ 
7 Baseline 24-h post-Rx Baseline 24-h post-Rx 


(Fig. 3c and Supplementary Fig. 4a). Kras/Lkb1 tumours displayed low 
basal activation of the MEK-ERK pathway (Fig. 3c and Supplementary 
Fig. 4a), consistent with immunostaining (Fig. 3a). Treatment with 
docetaxel had no discernable impact on the MEK-ERK pathway in 
any genotype (Fig. 3a-c and Supplementary Fig. 4b). Although 
selumetinib alone resulted in decreased phospho-ERK, residual activity 
was still present (Fig. 3c and Supplementary Fig. 4b). Treatment with 
both docetaxel and selumetinib more effectively eradicated phospho- 
ERK activity (Fig. 3c and Supplementary Fig. 4b). Pharmacokinetic 
studies suggested that selumetinib levels were elevated in the serum 
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and tumours of mice treated with selumetinib combined with docetaxel 
compared to selumetinib alone (Supplementary Table 3), perhaps pro- 
viding a mechanism for the more potent suppression of MEK-ERK 
signalling by the combination (Fig. 3c). The potential relevance of these 
findings to human disease was investigated by assessing phospho-ERK 
staining in a set of 57 human NSCLC tumour samples with known 
KRAS, p53 and LKB1 mutation status. Consistent with our findings in 
murine tumours, of seven patients harbouring the KRAS activating 
mutation, the three patients with concurrent p53 loss showed higher 
phospho-ERK activity (Fig. 3d). 
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The decreased activation of ERK phosphorylation in Kras/Lkb1 
tumours suggests that the proliferation of these tumours may be driven 
through other signalling pathways. On the basis of our prior studies”, 
we investigated the activity of AKT and SRC in Kras/Lkb1 mutant 
tumours. Immunoblotting with activation-state-specific antibodies 
revealed that Kras/Lkb1-mutant tumours have heightened activation 
of both AKT and SRC (Supplementary Fig. 3a, b), consistent with the 
finding of increased FDG avidity in Kras/Lkb1 tumours (Fig. 2a, b), 
because PI3K regulates expression of GLUT1 (Supplementary Fig. 3a). 
These results suggest that concomitant mutation of Kras and Lkb1 may 
alter the signalling circuitry in tumour cells from one dependent on 
MEK-ERK (in Kras and Kras/p53 tumours) to one that has more active 
AKT and SRC pathways, resulting in primary resistance to docetaxel 
and selumetinib. 

The concurrent human clinical trial does not include a treatment 
arm in which patients are treated with selumetinib alone, based on lack 
of efficacy in a phase II clinical trial in patients with NSCLC”, and on 
our preclinical data in Kras genetically engineered mice”’. In mice with 
Kras, Kras/p53 and Kras/Lbk1 tumours, treatment with selumetinib 
alone resulted in a heterogeneous reduction in FDG-PET uptake 
(Supplementary Fig. 5a), consistent with pharmacodynamic evidence 
that selumetinib alone partially attenuates MEK-ERK signalling 
within tumours (Fig. 3c and Supplementary Fig. 7c). However, no 
partial responses were achieved in any genotype with selumetinib 
monotherapy, although there was attenuation of tumour growth 
compared to untreated controls (Supplementary Fig. 5b). Together, 
these data suggest that selumitinib as monotherapy modulates MEK- 
ERK signalling in Kras-driven tumours, but is insufficient for clinical 
benefit in mice and humans. 

We determined the long-term benefit of combined treatment with 
docetaxel and selumetinib in the Kras- and Kras/p53-mutant mice com- 
pared to chronic treatment with docetaxel monotherapy. We did not 
assess long-term treatment outcome in Kras/Lkb1 animals given the 
primary resistance to both treatments in these animals (Figs 1-3). In 
mice with Kras tumours, treatment with docetaxel alone stabilized 
disease for several weeks, whereas the addition of selumetinib caused 
frank tumour regression and slower tumour re-growth (Fig. 4a and 
Supplementary Fig. 6a, b). Accordingly, the addition of selumetinib to 
docetaxel significantly prolonged progression-free survival (Fig. 4b). In 
mice with Kras/p53 tumours, treatment with docetaxel alone largely 
resulted in progressive disease, whereas animals treated with a combina- 
tion of docetaxel and selumetinib had initial disease regression before 
progression (Fig. 4a and Supplementary Fig. 6c), resulting in prolonged 
progression-free survival (Fig. 4c). These results demonstrate that the 
enhanced response to treatment with combined therapy translates to 
improved progression-free survival, albeit not outright cure, in mice 
bearing Kras- and Kras/p53-mutant tumours. 

To investigate mechanisms of resistance upon disease progression, 
tumour nodules were isolated from moribund animals after long-term 
treatment with docetaxel and selumetinib. In all animals examined (5/ 
5 in Kras/p53 and 11/11 in Kras), tumour nodules showed recrudes- 
cence of ERK phosphorylation (Fig. 4d and Supplementary Fig. 7a), 
suggesting that acquired resistance could be partly due to reactivation 
of MEK-ERK signalling despite ongoing treatment with selumetinib. 
Weevaluated treatment-resistant nodules for ERK amplification (Sup- 
plementary Fig. 7b), activation of parallel signalling pathways (Sup- 
plementary Fig. 7c), and drug pharmacokinetics (Supplementary 
Fig. 7d), and did not find consistent changes, suggesting more than 
one mechanism for pathway reactivation. Efforts to identify the diversity 
of mechanisms responsible for acquired resistance are ongoing. 

This co-clinical study provides several insights and predictions that 
affect the interpretation of the concurrent human clinical trial. First, 
these results predict that combination therapy with docetaxel and selu- 
metinib will be more effective than docetaxel alone in several sub- 
classes of KRAS-mutant NSCLC. These data are consistent with the 
results of the human phase II clinical trial described in a recent 
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Figure 4 | Long-term treatment outcome in Kras and Kras/p53 mice. 

a, Tumour volume was longitudinally assessed by MRI imaging in Kras and 
Kras/p53 mice treated with either docetaxel or docetaxel plus selumetinib. Data 
points represent median tumour volume relative to start of treatment for all 
available data at the indicated time point. b, Progression-free survival for Kras 
mice treated with either docetaxel or docetaxel plus selumetinib. Median 
survival for single and combination treatments was 6 weeks and 12 weeks 
respectively, with ***P = 0.0003 by log-rank test. c, Progression-free survival 
for Kras/p53 mice treated with either docetaxel or docetaxel plus selumetinib. 
Median survival for single and combination treatments was 2 weeks and 4 
weeks, respectively, with ***P < 0.0001 by log-rank test. Progression was 
defined as the time point when total tumour volume exceeded the baseline 
volume. d, Immunostaining of activation-specific phospho-ERK of tumours 
from Kras/p53 and Kras mice with acquired resistance to docetaxel and 
selumetinib treatment. 


press release (http://phx.corporate-ir.net/phoenix.zhtml?c= 123810&p= 
irol-newsArticle&ID= 1611800). However, our studies predict that con- 
current mutation of LKB1 will confer primary resistance to combination 
therapy, possibly through activation of parallel signalling pathways such as 
AKT and SRC. As LKB] status is not being prospectively assessed in the 
ongoing human clinical trial, inclusion of patients with cancers harbouring 
concurrent LKBI mutations may blunt differences between treatment 
arms based solely on KRAS status. These results suggest that a retrospective 
analysis of p53 and LKB1 status in samples from the concurrent human 
clinical trial is warranted, and lead us to advocate prospective analysis in 
future clinical trials with sufficient enrolment to all strata to enable suffi- 
ciently powered sub-group analyses. 

Beyond assessing genetic modulators, co-clinical studies allow for 
validation of biomarker strategies and discovery of mechanisms of res- 
istance that may benefit future clinical trials. In this study, we observed the 
potential utility of FDG-PET imaging as a biomarker strategy for iden- 
tifying an enriched responder population and predicting long-term out- 
come. Prior studies have suggested that lung tumour hypermetabolism at 
the time of diagnosis predicts poor outcome in response to conventional 
therapies”, our current study further suggests that high baseline FDG 
avidity may predict poor response to targeted therapy with selumetinib 
combined with docetaxel. Specifically, loss of LKB1 function appears to 
confer increased FDG avidity, probably through upregulated expres- 
sion of glucose transporters. As current approaches for assessing LKB1 
status are not comprehensive, FDG-PET imaging may represent a 
practicable patient stratification strategy. Furthermore, the current 
preclinical study suggests a role for repeat FDG-PET imaging early 
in the course of treatment as a potential predictor of outcome, as 
metabolic changes may be apparent within 24h of initiating therapy. 
In these studies, we also observed reactivation of the MEK-ERK 
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signalling pathway in mice that became resistant to the combination of 
selumetinib and docetaxel. Although the exact mechanisms respons- 
ible for pathway reactivation remain to be elucidated, mechanisms of 
resistance discovered in co-clinical studies should be confirmed in 
human clinical trials by examining biopsy samples from patients 
who relapse on therapy. The ability to assess mechanisms of resistance 
in the preclinical setting may uncover rational combinatorial strategies 
that can be implemented in future clinical studies. 

Building upon prior success using genetically engineered mouse 
models**’’, the current study demonstrates that co-clinical trials can 
provide data that has value beyond predicting the outcome of clinical 
trials, and can rapidly generate new clinically relevant hypotheses that 
can affect how the concurrent human clinical trial is analysed, and 
inform the design of future clinical studies. As similar efforts are under- 
taken in other cancer disease types, we anticipate that murine co- 
clinical trials will enable more effective oncology drug development. 


METHODS SUMMARY 


Mice. Mouse strains harbouring a conditional activating mutation (G12D) at the 
endogenous Kras locus, conditional Lkb1 knockout, and conditional p53 knockout 
were described previously’. Genotypes were confirmed by PCR (Supplementary 
Fig. 8). All studies were performed on protocols approved by Dana-Farber Cancer 
Institute and University of North Carolina Animal Care and Use Committees, and 
all mice used are listed in Supplementary Table 4. 

MRI quantification. 3D Slicer was used to reconstruct MRI volumetric measure- 
ments**”” (Supplementary Fig. la). To assess variation between independent 
operators, Bland-Altman analysis was performed using quantification results 
from the two operators on a total of 16 MRI scan images (Supplementary Fig. 1b). 
Pharmacokinetics. Docetaxel concentrations in serum, lung, and tumour nodules 
was determined using published doses (8 and 16mgkg_'). Drug concentration 
was determined 3h after the last dose (Supplementary Table 1). All serum con- 
centrations were within the range found in the clinical setting’. Selumetinib was 
administered as previously described’, and pharmacokinetics in mice were also 
documented (Supplementary Table 3). 

PET/CT studies. All murine FDG-PET/CT studies were performed with a pre- 
clinical small animal PET/CT system (Siemens Inveon) after injection with 
14MBg of '8F-FDG. Mice used for PET/CT studies are listed in Supplementary 
Table 5. 

Human samples and clinical information. All human samples and clinical 
information were obtained under Institutional Review Board approved protocols 
(02-180 and 07-0120), and patient information is listed in Supplementary Tables 7 
and 8. A tissue microarray (TMA) was generated from genotyped human lung 
cancer samples as previously described’. The TMA was immunostained for 
phospho-ERK and scored by a pathologist blinded to patient information. 
Immunohistochemistry staining. Immunohistochemical analyses assessing 
phospho-ERK, activated caspase 3, and Ki-67 were performed as previously 
described’. Scoring was done by a pathologist, using the same parameters used 
for scoring human specimens. 
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Opioid receptors mediate the actions of endogenous and exogenous opioids on many physiological processes, including the 
regulation of pain, respiratory drive, mood, and—in the case of «- opioid receptor (x-OR)—dysphoria and psychotomimesis. 
Here we report the crystal structure of the human k- OR in complex with the selective antagonist JDTic, arranged in parallel 
dimers, at 2.9 A resolution. The structure reveals important features of the ligand-binding pocket that contribute to the high 
affinity and subtype selectivity of JDTic for the human «-OR. Modelling of other important x-OR-selective ligands, 
including the morphinan-derived antagonists norbinaltorphimine and 5’-guanidinonaltrindole, and the diterpene 
agonist salvinorin A analogue RB-64, reveals both common and distinct features for binding these diverse chemotypes. 
Analysis of site-directed mutagenesis and ligand structure-activity relationships confirms the interactions observed in the 
crystal structure, thereby providing a molecular explanation for k-OR subtype selectivity, and essential insights for the 
design of compounds with new pharmacological properties targeting the human x-OR. 


The four opioid receptors, [1, 5, « and the nociceptin/orphanin FQ 
peptide receptor, belong to the class A (rhodopsin-like) y subfamily of 
G-protein-coupled receptors (GPCRs)'’ with a common seven- 
transmembrane helical architecture, and are coupled predominantly 
to heterotrimeric G,/G, proteins. Activation of these receptors by 
endogenous or exogenous ligands is linked to a number of neuro- 
psychiatric sequelae, including analgesia, sedation, depression, 
dysphoria and euphoria’. The three closely related subtypes, t-OR, 
5-OR and k-OR, share ~70% sequence identity in their seven trans- 
membrane helices (I-VII), with more variations in the extracellular 
loops (ECLs) and very little similarity in their amino and carboxy 
termini’. The majority of endogenous opioid peptides have a defined 
preference for specific subtypes, for example, endorphins act via 
5-ORs and l-ORs, whereas dynorphins preferentially activate k-ORs. 
However, most exogenous and synthetic opioid ligands interact 
promiscuously (see the K; Database; http://pdsp.med.unc.edu/pdsp. 
php), probably owing to the high degree of similarity among binding 
pockets of opioid receptors. Although decades of focused medicinal 
chemistry efforts have yielded reasonably selective ligands for all four 
ORs (see the K, Database), there remains substantial interest in the 
development of subtype-selective agonists and antagonists. 

Recent breakthroughs in elucidating high-resolution structures of 
GPCRs in complex with small-molecule*”’ and peptide® ligands are 
providing details of their function’, leading to numerous rational 
ligand discovery studies’®''. However, whereas most of these struc- 
tures belong to the « subfamily of class A GPCRs’, the highly diverse 
peptide-binding y subfamily is represented only by the CXCR4 
chemokine receptor*®; additional structural coverage is needed to 
elucidate the repertoire of features’* that define the pharmacological 
profile of this subfamily. The k-OR, identified based on studies with 
the k-type prototypic agonist ketocyclazocine’’, represents an attrac- 
tive target for structure determination. Several «-OR-selective 
partial agonists and antagonists have been developed as potential 


antidepressants, anxiolytics and anti-addiction medications", whereas 
a widely abused, naturally occurring hallucinogen—salvinorin A 
(SalA)—was also found to be a highly selective k-OR agonist’. 
Although many «-OR agonists and antagonists have not demonstrated 
desirable pharmacological properties, lacking specificity or displaying 
frank psychotomimetic actions in humans'*’*, some have been shown 
to be viable drug candidates. A «-OR ligand in early stages of 
clinical development, JDTic (3R)-1,2,3,4-tetrahydro-7-hydroxy-N- 
[(1S)-1-[[(3R,4R)-4-(3-hydroxyphenyl)-3,4-dimethyl-1-piperidinyl] 
methyl]-2-methylpropyl]-3-isoquinolinecarboxamide), was originally 
designed as a novel selective k-OR antagonist’” that blocks the k-OR 
agonist U50,488-induced antinociception, while not antagonizing U1-OR 
agonist-induced analgesia’®. JDTic also displays robust activity in 
rodent models of depression, anxiety, stress-induced cocaine relapse, 
and nicotine withdrawal'*’’. Here we report the crystal structure of a 
human k-OR construct, k-OR-T4 lysozyme (T4L), in complex with 
JDTic at 2.9 A resolution. The results provide structural insights into 
the atomic details of molecular recognition and subtype selectivity of the 
«-ORand related ORs, and should catalyse the structure-based design of 
advanced human k-OR agonists and antagonists with improved phar- 
macological profiles and enhanced therapeutic efficacies. 


Overall architecture of the x-OR 


Structural studies were carried out using an engineered human k-OR 
construct (see Methods and Supplementary Fig. 1) and crystallized in 
cholesterol-doped monoolein lipidic cubic mesophase (see Methods). 
The construct used showed pharmacological behaviour similar to that 
of a native receptor expressed in HEK293T cells (Supplementary 
Tables 2 and 3). Data collection and refinement statistics are shown 
in Supplementary Table 1. 

The structure of «-OR-JDTic was determined at 2.9 A in the 
P2,2,2, space group. The asymmetric unit consists of two receptors 
forming a parallel dimer (Fig. 1a). The dimer interface with ~ 1,100 A? 
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Figure 1 | Crystal packing and overview of the human k-OR structure in 
complex with JDTic, and comparison with the inactive CXCR4 and B,-AR 
structures. a, k-OR-T4L crystal packing. The parallel dimer in one 
asymmetric unit is highlighted by the insert. b, Overall architecture of k-OR- 
TAL in complex with JDTic. The A molecule (yellow) and B molecule (blue) 
from one asymmetric unit are aligned through the receptor part. The DRY and 


buried surface area is formed through contacts among helices I, Il and 
VIII (Fig. la, insert). Previously, parallel receptor dimers have been 
identified in crystal structures of activated rhodopsin (involving helices 
I, Il and VII)”, the B, adrenergic receptor (B,-AR; cholesterol 
mediated)’ and CXCR4 (involving helices IV, V and VI)*. Consistent 
with these crystallographic data, recent biochemical studies have sug- 
gested the existence of two dimerization interfaces: along helices IV and 
V (sensitive to receptor activation) and along helix I (insensitive to the 
state of activation)”". Although the orientations of the two T4L copies in 
the receptor monomers in one asymmetric unit differ by ~60° rotation, 
both copies of the receptor are highly similar (Fig. 1b) and will be 
treated identically except where otherwise noted. 

The main fold of the human «-OR consists of a canonical seven- 
transmembrane bundle of «-helices followed by an intracellular helix 
VIH that runs parallel to the membrane (Fig. 1a, b), resembling previ- 
ously solved GPCR structures**. Structural comparison with other 
GPCRs suggests that human k-OR has marked similarities in the 
ECL region with CXCR4, another peptide-binding receptor in the y 
subfamily. In the seven-transmembrane region, however, the k-OR 
structure is closer to aminergic receptors belonging to the subfamily 
(alpha carbon root mean squared deviation (r.m.s.d.) ~2.3 A for the 
B.-AR, ~1.9 A for the dopamine D3 receptor (D3R) and ~2.7A for 
CXCR4). The structure reveals distinctive features of the human k-OR, 
including the following. First, conformation of the extracellular end of 
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NPXXY motifs are highlighted in red and blue, respectively. JDTic is shown ina 
green sphere representation and the disulphide bond is coloured orange. 

c, d, Side (c) and extracellular (d) views of a structural alignment of the human 
k-OR (yellow); CXCR4 (PDB accession 30DU; magenta) and B,-AR (PDB 
accession 2RH1; cyan). The graphics were created by PyMOL. 


helix I deviates from the position observed in CXCR4, where the tip of 
helix I is pulled towards the transmembrane bundle by a disulphide 
bond between the N terminus and ECL3. Second, ECL2, the largest 
extracellular loop of the human k-OR, forms a $-hairpin similar to that 
observed in CXCR4, despite the low sequence similarity in this domain 
between the two receptors. Conservation of this feature between these 
peptide receptors suggests that the §-hairpin could be acommon motif 
in the ECL2 of other y subfamily receptors, where interactions between 
ECL2 and their endogenous peptide ligands are deemed important for 
ligand recognition and selectivity”. Third, unlike other solved non- 
rhodopsin class A GPCRs that have more than one disulphide bond, 
the human «-OR has only one formed between Cys 131°? (super- 
scripts indicate residue numbering using the Ballesteros—-Weinstein 
nomenclature”) and Cys 210, bridging ECL2 to the end of helix III. 
These two cysteines are conserved in all opioid receptors and this 
disulphide bond is the canonical one shared by most other solved class 
A GPCRs. Fourth, intracellular loop 2 (ICL2) adopts slightly different 
structures in the two k-OR molecules in the asymmetric unit, involving 
a two-turn o-helix in molecule B, and only a one-turn o-helix in 
molecule A (Supplementary Fig. 2), possibly reflecting the conforma- 
tional plasticity of this region®. Last, ECL3 of the k-OR is disordered. 
Of the approximately 11 residues in this loop (residues 300-310), 6 
residues in molecule A and 3 in molecule B do not have interpretable 
electron density. 
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A common feature of the class A GPCRs is the presence of a con- 
served sequence motif Asp/Glu**?-Arg*°°-Tyr*?? (D/ERY) located at 
the cytoplasmic end of helix III. A salt bridge interaction between 
Arg’? and Asp/Glu°*? from the cytoplasmic end of helix VI consti- 
tutes an ‘ionic lock’, which is thought to stabilize the inactive con- 
formation of rhodopsin and other rhodopsin-like class A GPCRs*”*, 
whereas its absence can enhance constitutive activity®”’. Although the 
human k-OR lacks either of the acidic residues Asp/Glu at position 
6.30, Arg 156°°° forms a hydrogen bond to another helix VI residue, 
Thr 273°** (Supplementary Fig. 3a) in this inactive k-OR structure, 
thereby conceivably stabilizing the inactive receptor conformation. 
The NPXXY motif located at the cytoplasmic side of helix VIL which 
is composed of Asn 326’, Pro 327’, Ile 32871, Leu3297°? and 
Tyr 330”? in the k-OR, is another highly conserved functional motif 
that is proposed to act as one of the molecular switches responsible for 
class A GPCR activation”®. Comparison of the human «-OR with 
inactive B.-AR and Az, adenosine receptor (Az,AR) structures 
(Supplementary Fig. 3b) reveals a similar conformation of this motif 
in these receptors, thereby supporting the hypothesis that the 
observed k-OR-JDTic complex structure corresponds to the inactive 
state. To establish further that JDTic stabilizes an inactive conforma- 
tion, we evaluated its ability to modulate G,/G,-mediated and 
B-arrestin-mediated signalling in transfected HEK293T cells. We 
found that JDTic was devoid of agonist activity at both canonical 
and non-canonical pathways and completely blocked the effects of 
the prototypic agonist U69593 (Supplementary Fig. 4). 


The k-OR ligand-binding pocket 

The «-OR ligand-binding pocket displays a unique combination of 
key characteristics both shared with and distinct from those in the 
chemokine and aminergic receptor families. Although the human 
«-OR binding pocket is comparatively large and partially capped by 
the ECL2 B-hairpin, as in CKCR4, it is also much narrower and deeper 
than in CXCR4 (Fig. 2c, d and Supplementary Fig. 5). In addition to a 
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different set of side chains lining the pocket, the shape differences 
result from an approximately 4.5 A inward shift of the extracellular 
tip of helix VI in the k-OR as compared to CXCR4. The electron 
density clearly shows the position of the JDTic ligand (Supplemen- 
tary Fig. 6), which reaches deep into the pocket to form ionic inter- 
actions with the Asp 138°*? side chain (Fig. 2a). The Asp*** residue is 
conserved in all aminergic GPCRs, thereby having a critical role in the 
selectivity of aminergic receptors towards protonated amine-containing 
ligands. Likewise, Asp*** is conserved in all opioid receptors, and 
modelling and mutagenesis studies” suggest that it has an essential 
role in anchoring positively charged k-OR ligands. 


Structural basis of JDTic selectivity 


JDTic, developed as a derivative of the trans-(3R,4R)-4-(3- 
hydroxyphenyl)-3,4-dimethyl-1-piperidine scaffold”, has exceptionally 
high affinity (K;j=0.32nM), potency (K,j=0.02 nM in GTPyS 
assays)'’**, long duration of action and a more than 1,000-fold selec- 
tivity for the human k-OR as compared to other opioid receptor sub- 
types**. Extensive structure-activity relationship (SAR) analyses 
performed on JDTic analogues have yielded important insights into 
key determinants of JDTic activity”, although reliable identification 
of the interaction mode(s) and contact residues of these ligands has not 
been feasible without a receptor crystal structure. 

The crystal structure of k-OR-JDTic shows a tight fit of the ligand 
in the bottom of the binding cleft (Fig. 2a), forming ionic, polar and 
extensive hydrophobic interactions with the receptor (Fig. 2b). The 
protonated amines in both piperidine and isoquinoline moieties of the 
ligand form salt bridges to the Asp 138°” side chain (3.0 and 2.8A 
nitrogen-oxygen for molecule A, and 2.7 and 2.3 A for molecule B, 
respectively). The piperidine amine is part of the original trans- 
(3R,4R)-dimethyl-4-(3-hydroxyphenyl)piperidine scaffold and is 
essential for opioid receptor antagonist activity’. SAR studies of 
JDTic analogues show that the isoquinoline nitrogen can be replaced 
by carbon, oxygen or sulphur atoms with only a ~10- to 50-fold 


Figure 2 | Binding of the high-affinity selective 
antagonist JDTic in the human k-OR crystal 
structure. a, Conformation of the binding pocket 
with JDTic shown by sticks with yellow carbons. 
The protein is displayed in cartoon representation 
looking down from the extracellular side, with the 
22 contact residues within 4.5 A from the ligand 
shown by white sticks. The pocket surface is shown 
as a semitransparent surface coloured according to 
binding properties (green: hydrophobic; blue: 
hydrogen-bond donor; red: hydrogen-bond 
acceptor). Salt bridges and hydrogen bonds are 
shown as dotted lines. Structured water molecules 
are shown as large magenta spheres. b, Diagram of 
ligand interactions in the binding pocket side 
chains at 4.5 A cut-off. Salt bridges are shown in red 
and direct hydrogen bonds in blue dashed lines. 
Ballesteros—-Weinstein numbering is shown as 
superscript. Residues that vary among the p-OR, 
5-OR and k-OR subtypes are highlighted in cyan, 
and residue Asp 138°” implicated in k-OR-ligand 
binding by mutagenesis data, is highlighted orange. 
c-e, Side views of the sliced binding pocket in 
k-OR-JDTic (c), CXCR4-IT1t (d) and B)-AR- 
carazolol (e) complexes. The pocket surfaces are 
coloured as in panel a, the protein interior is black 
and the extracellular space is white. Ligands are 
shown as capped sticks with carbons coloured 
yellow (JDTic), magenta (IT1t) and cyan 
(carazolol). Asp? side chains in «-OR-JDTic and 
B-AR-carazolol complexes are shown by thin 
sticks with grey carbons. The graphics were 
prepared using the ICM molecular modelling 
package (Molsoft LLC). 
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reduction in affinity”®. Similar to the observed JDTic conformation in 
the k-OR-JDTic complex, a V-shaped conformation was found in the 
small molecule X-ray crystal structure of JDTic, which showed its 
amino groups coordinating a water molecule (Supplementary Fig. 
7a). Although several rotatable bonds within the JDTic molecule allow 
for the sampling of different conformations (see Supplementary Fig. 
7b) and facilitate the ligand passage through the narrow binding 
pocket entrance, the anchoring-type interaction of two amino groups 
with Asp 138°°* probably fixes the ligand in this characteristic V 
shape. 

SAR studies have also underscored the importance of the distal 
hydroxyl groups on both the piperidine and isoquinoline moieties 
of JDTic, the removal of which did result in about a 100-fold reduction 
of affinity. A much smaller effect was observed upon methylation of 
these hydroxyls or their replacement by other polar groups”*. These 
SAR results suggest the importance of water-mediated interactions 
between these two hydroxyl groups and the receptor. Indeed, 
although the crystal structure does not show direct hydrogen bonding 
with the receptor for both hydroxyl groups, there is clear electron 
density for several structured water molecules that mediate their polar 
interactions (Supplementary Fig. 6). 

The k-OR structure provides important clues for understanding the 
structural basis of the exceptional subtype selectivity of JDTic. Among 
many extensive contacts, JDTic interacts with four residues in the 
binding pocket that differ in other closely related opioid receptors, 
which are thought to contribute to the subtype selectivity of JDTic 
and other k-OR-selective ligands** (human [!-OR and 6-OR amino 
acids are shown in parentheses, respectively): Val 1087? (Ala and Ala), 
Val 118°? (Asn and Lys), Ile 294°°° (Val and Val) and ‘Tyrs12”” 
(Trp and Leu) (Fig. 2b and Supplementary Fig. 8). Analysis of JDTic 
binding into «-OR-based f/-OR and 5-OR homology models, as well 
as JDTic SAR results'’’**? (Supplementary Fig. 9), suggest that all 
described residues can contribute to the JDTic selectivity profile. 
Thus, changes in the Val 118”°° side chain, where larger hydrophilic 
residues Asn”? and Lys”? are found in the human pt-OR and 6-OR, 
respectively, are likely to introduce unfavourable contacts with JDTic. 
Additionally, changing Tyr 312’*° to the Trp’? and Leu’? residues 
found in the human p-OR and 6-OR, respectively, is likely to result in 
the loss of an important polar interaction with the JDTic amide. The 
remaining two hydrophobic side-chain replacements, Val to Ala at 
position 2.53 and Ile to Val at position 6.55, may cause a reduction 
of the hydrophobic contact between JDTic and the receptor. 

The isopropyl group of JDTic reaches deep into the orthosteric 
pocket to form a hydrophobic interaction with a conserved 
Trp 287°" side chain, possibly having a critical role in the pharmaco- 
logical properties of this ligand. Trp°** is thought to be a key part of 
the activation mechanism in many class A GPCRs, including rhodop- 
sin”® and the A,,AR”, and similar hydrophobic contacts have been 
implicated in blocking activation-related conformational changes in 
the dark state visual rhodopsin by 11-cis retinal, and by inverse ago- 
nists in the A,,AR and D3R. 


Binding of x-OR-selective morphinans 

Prior mutagenesis and modelling studies suggested that many small- 
molecule opioid ligands can interact with the k-OR, as well as with the 
ut-OR and 5-OR, by forming a salt bridge with the highly conserved 
Asp**? (refs 33, 34). This is consistent with our mutagenesis studies 
(Supplementary Table 3) and flexible docking” of a series of morphine 
analogues, including selective k-OR antagonists norbinaltorphimine 
(nor-BNI) and 5’-guanidinonaltrindole (GNTI) (Fig. 3 and Sup- 
plementary Fig. 10). To assess the compatibility of these bulky and rigid 
ligands with the observed k-OR protein backbone conformation, we 
performed global energy optimizations of nor-BNI and GNTI in the 
binding cavity of k-OR, keeping side chains of the binding pocket fully 
flexible. Multiple independent runs consistently resulted in low energy 
conformations with essentially identical poses and receptor contacts 
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Figure 3 | Putative interaction modes of morphine-based high-affinity 
k-OR-selective antagonists nor-BNI and GNTI. a, b, Interaction modes of 
nor-BNI (a) and GNTI (b). Ligands are depicted as capped sticks with green 
carbons, and contact side chains of the receptor within 4 A from the ligand are 
shown with grey carbons. Key hydrogen bonds and salt bridges are indicated 
with small cyan spheres and residues unique to the k-OR are labelled in blue. 
Residue Asp 138°”, which also shows critical impact on GNTI and nor-BNI 
binding in mutagenesis studies, is highlighted in red. Ballesteros—-Weinstein 
residue numbers are shown under the k-OR residue numbers. The graphics 
were prepared using the ICM molecular modelling package (Molsoft LLC). 


for the common naltrexone moieties of both nor-BNI and GNTI 
(r.m.s.d.0.85 A). In addition to a highly complementary van der 
Waals interface, both compounds formed an amino group salt bridge 
to the Asp 138°”? side chain and a hydrogen bond to the Tyr 139°” side 
chain, both of which are important anchoring points for binding of 
morphine-based ligands, as supported by previous mutagenesis 
studies™. 

Moreover, unlike JDTic, both nor-BNI and GNTI compounds have 
a second basic moiety located more than 10 A away from the first 
amino group (the second morphine moiety in nor-BNI and the 
guanidine moiety in GNTI). In the predicted models of k-OR- 
nor-BNI/GNTI complexes, these additional amino groups of both 
ligands form a salt bridge with Glu 297°°* located at the entrance to 
the ligand-binding pocket, which was previously characterized as a 
residue critical for subtype selectivity of k-OR-selective morphinan 
derivatives**. This interaction is also supported by our mutagenesis 
results (Supplementary Table 3), where a Glu297Ala mutation 
induced a significant drop in both nor-BNI and GNTI binding affinity, 
but did not affect JDTic affinity. Hydrophobic interactions at the 
«-OR-specific residue Ile 294 were also found for both nor-BNI and 
GNTI; consistent with our mutagenesis results (Supplementary 
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Figure 4 | Model of covalently bound RB-64. a, b, Putative binding mode of 
the RB-64 +463 AMU (a) and the RB-64 +431 amu (b) adduct. Residues within 
4A of the ligand are shown. Ligand, capped sticks/cyan carbons; K-OR side 
chains, capped sticks; hydrogen bonds, small green spheres; k-OR-unique 


Table 3) and suggesting that Ile294 may also be important for 
developing human «-OR-subtype-selective morphinan derivatives. 
Additional polar interactions with «-OR-specific residues, Glu 209 
and Ser211 in ECL2, are found for nor-BNI, which may further 
enhance the k-OR selectivity of this bulky ligand. Another side chain 
of the pocket, His 291°”, which is involved in the highly conserved 
aromatic cluster around Trp°** and thought to have a critical role in 
the receptor activation process’’, forms hydrophobic contacts with 
JDTic, nor-BNI and GNTI. His 291°? can be mutated to another 
aromatic residue, phenylalanine, without disrupting binding of 
these antagonists (Supplementary Table 3). The non-conservative 
His291°°’Lys mutation, however, abolished binding of all tested 
ligands, probably because of the disruption of the aromatic cluster 
induced by the lysine side chain. Interestingly, the cyclopropyl moiety 
of both nor-BNI and GNTI in these binding poses has the same 
position as the isopropyl moiety of JDTic, making hydrophobic contact 
with the conserved residue Trp 287°**. This cyclopropyl moiety is 
generally implicated in conversion of opioid agonists into antagonists 
(for example, agonist oxymorphone into antagonist naltrexone), and 
this effect may be partially explained by a direct interaction with the 
Trp 287°** side chain. 

Overall, these structure-based docking results support the ‘message- 
address’ model”* for morphine-based ligands nor-BNI and GNTI*, 
which points to Glu297°°* as a key side chain that controls k-OR 
selectivity by anchoring the ‘address’ moieties of these compounds. 
The crystal structure of the k-OR-JDTic complex (Figs 2 and 3), 
however, demonstrates that even without an ‘address’ interaction with 
Glu 297°°*, more than a 1,000-fold subtype-selectivity to k-OR can be 
achieved for JDTic and some of its derivatives. Importantly, then, the 
message-address hypothesis does not uniformly apply to all k-OR- 
selective antagonists. 


Binding of salvinorins 

SalA, a naturally occurring diterpene from the widely abused 
hallucinogenic plant Salvia divinorum, represents an exceedingly 
potent (half-maximum effective concentration (ECs9) = 1nM) and 
selective k-OR agonist (>1,000-fold)’*. SalA is unique compared to 
other k-OR ligands in that it lacks a charged or polar nitrogen atom to 
anchor it in the binding pocket. Extensive site-directed mutagenesis, 
substituted cysteine-accessibility mutagenesis (SCAM) and SAR studies 
on SalA and its analogues have been performed, indicating (among 
others) that the 2-acetoxy moiety interacts with Cys 315’** (ref. 39). 
Possible modes of interaction between the cysteine-reactive and ultra- 
potent agonist and SalA analogue 22-thiocyanatosalvinorin A (RB-64; 
K, = 0.59nM; ECs9 = 0.077 nM)*’, and the human «-OR structure 
were thus evaluated. Exposure of k-OR to RB-64 produces irreversibly 
bound, wash-resistant adducts that are tethered to Cys 3 15’°8 (ref. 39). 
As the thiocyanate group contains two electrophilic centres, two 


residues are labelled in blue. Ballesteros—Weinstein residue numbers are shown 
under the k-OR residue numbers. The graphics were prepared using the ICM 
molecular modelling package (Molsoft LLC). 


distinct adducts may be formed, increasing the mass by either 463 or 
431 amu. Docking studies using GOLD® predict that the salvinorin 
2-position can access Cys 315”°° while maintaining many of the inter- 
actions implicated by site-directed mutagenesis for SalA, providing a 
possible mechanism for the formation of the k-OR-RB-64 adduct 
(Fig. 4, Supplementary Tables 4 and 5, and Supplementary Figs 11 
and 12). Additionally, the docking results serve as a model of the initial 
recognition process of SalA-related agonists of the human k-OR in an 
inactive state, although additional studies will be needed to fully 
elucidate the nature of the SalA-induced activation mechanism. 


Conclusions 


The k-OR-JDTic crystal structure has uncovered a combination of key 
features shared with chemokine and aminergic GPCRs along with unique 
structural details characteristic of the opioid subfamily. The human k-OR 
was crystallized as a parallel dimer with contacts involving helices I, land 
VIII. Although the existence of GPCR dimers in vivo and their physio- 
logical relevance remain highly debatable, several distinct potential dimer 
interfaces are starting to emerge from crystallographic and biochemical 
studies. Such multiple dimerization interfaces may serve to support dif- 
ferent functional pathways, as well as to promote oligomeric assembly of 
GPCRs. Analysis of ligand-receptor interactions has revealed important 
molecular details of the exceptionally high affinity and subtype selectivity 
of JDTic, a small-molecule antagonist with a broad therapeutic potential. 
The elucidation of a large binding cavity with a multitude of potential 
anchoring points begins to explain both the broad structural diversity of 
drugs targeting the human k-OR and differences in their receptor inter- 
action modes, as supported by differential effects of various site-directed 
mutations on the binding properties of chemically diverse prototypic 
ligands. The human «-OR structure provides a long anticipated 
molecular framework for understanding opioid drug action, and thereby 
affords valuable new opportunities for the structure-based discovery of 
new drugs with ideal pharmacological properties. 


METHODS SUMMARY 


k-OR-TA4L was expressed in Spodoptera frugiperda (Sf9) cells. Ligand-binding and 
functional assays were performed as described in Methods. Receptor-ligand com- 
plexes were solubilized from washed Sf9 membranes using 1% (w/v) n-dodecyl-f-p- 
maltopyranoside (DDM) and 0.2% (w/v) cholesteryl hemisuccinate (CHS), and 
purified by immobilized metal ion affinity chromatography (IMAC), followed by 
reverse IMAC after cleaving N-terminal Flag-10 His tags by His-tagged tobacco 
etch virus (TEV) protease. The purified protein solution was mixed with monoolein 
and cholesterol in a ratio of 40%:54%:6% (w/w) to form lipidic cubic phase (LCP) 
from which the receptor was crystallized. Crystals were grown at 20°C in 45nl 
protein-laden LCP boluses overlaid by 800 nl of precipitant solutions as described 
in Methods. Crystals were harvested from the LCP matrix and flash frozen in liquid 
nitrogen. X-ray diffraction data were collected on the 23ID-B/D beamline (GM/CA 
CAT) at the Advanced Photon Source, Argonne, using a 101m minibeam at a 
wavelength of 1.0330 A. Data collection, processing, structure solution and refine- 
ment are described in Methods. Modelling of JDTic analogues and k-OR-selective 
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morphine derivatives nor-BNI and GNTI was performed using ICM-Pro; SYBYL- 
X 1.3 and GOLD Suite 5.1 were used to model RB-64 complexes, as described in 
Methods. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Protein engineering for structural studies. Human «-OR was engineered for 
structural studies by fusing lysozyme from T4 phage (T4L) into ICL3 (Gly 261- 
Arg 263) and further modified by N/C-terminal truncations (AGlu2-Ala42, 
AArg359-Val380) and a single point mutation Ilel35°7°Leu (see Sup- 
plementary Information). The resulting k-OR-T4L construct was subsequently 
expressed in baculovirus-infected Spodoptera frugiperda (Sf9) insect cells. 
Generation of k-OR constructs for Sf9 expression. The human «-OR (I135L) 
cDNA provided by the NIMH Psychoactive Drug Screening Program was cloned 
into a modified pFastBacl vector (Invitrogen), designated as pFastBac1-833100, 
which contained an expression cassette with a haemagglutinin (HA) signal 
sequence followed by a Flag tag, a 10 His tag*, and a TEV protease recognition 
site at the N terminus before the receptor sequence. Subcloning into the 
pFastBacl-833100 was achieved using PCR with primer pairs encoding restric- 
tion sites BamHI at the 5’ and HindIII at the 3’ termini of k-OR wild type with 
subsequent ligation into the corresponding restriction sites found in the vector. 

The k-OR-T4L gene, based on the human k-OR (I135L) and cysteine-free 
lysozyme from bacteriophage T4 (T4L C54T, C97A) sequences"’, included the 
following additional features: (1) residue Ser 262 at ICL3 of k-OR was deleted by 
using standard QuickChange PCR; (2) Asn2-Tyr161 of T4L were inserted 
between Gly 261 and Arg 263 within the ICL3 region; and (3) N-terminal residues 
2-42 and C-terminal residues 359-380 of k-OR were truncated. 

Expression and purification of K-OR constructs. High-titre recombinant 
baculovirus (>10° viral particles per ml) was obtained using the Bac-to-Bac 
Baculovirus Expression System (Invitrogen) as previously described**. 25 1M 
of the antagonist naltrexone (NTX) and 5% Protein Boost Additive (PBA) were 
added to the system during expression. Cell suspensions were incubated for 4 
days while shaking at 27°C. Production of high-titre baculovirus stocks was 
performed as described before**. Sf9 cells at a cell density of 2-3 X 10° cells ml“! 
were infected with P2 virus at a m.o.i. (multiplicity of infection) of 2. Cells were 
harvested by centrifugation at 48 h post-infection and stored at —80 °C until use. 

Insect cell membranes were disrupted by thawing frozen cell pellets in a hypo- 
tonic buffer containing 10 mM HEPES, pH 7.5, 10 mM MgCl, 20mM KCl and 
EDTA-free complete protease inhibitor cocktail tablets (Roche). Extensive wash- 
ing of the raw membranes was performed by repeated centrifugation in the same 
hypotonic buffer (two to three times), and then in a high osmotic buffer contain- 
ing 1.0 M NaCl, 10 mM HEPES, pH 7.5, 10 mM MgCl, 20 mM KCl and EDTA- 
free complete protease inhibitor cocktail tablets (three to four times), thereby 
separating soluble and membrane associated proteins from integral transmem- 
brane proteins. 

Washed membranes were resuspended into buffer containing 40 uM NTX, 
2mgml' iodoacetamide, 150mM NaCl and EDTA-free complete protease 
inhibitor cocktail tablets, and incubated at 4°C for 1h before solubilization. 
The membranes were then solubilized in 50mM HEPES, pH 7.5, 150mM 
NaCl, 1% (w/v) n-dodecyl-f-p-maltopyranoside (DDM, Anatrace), 0.2% (w/v) 
cholesteryl hemisuccinate (CHS, Sigma) and 20 WM NTX for 3h at 4°C. The 
supernatant was isolated by centrifugation at 160,000g for 40 min, and incubated 
in 30mM buffered imidazole (pH 7.5), 1M NaCl with TALON IMAC resin 
(Clontech) overnight at 4 °C. After binding, the resin was washed with 10 column 
volumes of Wash I Buffer (50 mM HEPES, pH 7.5, 800mM NaCl, 10% (v/v) 
glycerol, 0.1% (w/v) DDM, 0.02% (w/v) CHS, 10mM ATP, 10mM MgCl, and 
50 UM JDTic), followed by 6 column volumes of Wash II Buffer (50 mM HEPES, 
pH 7.5, 500 mM NaCl, 10% (v/v) glycerol, 0.05% (w/v) DDM, 0.01% (w/v) CHS, 
50 mM imidazole and 50 .M JDTic). The protein was then eluted by 3 column 
volumes of Elution Buffer (50 mM HEPES, pH 7.5, 300mM NaCl, 10% (v/v) 
glycerol, 0.03% (w/v) DDM, 0.006% (w/v) CHS, 250 mM imidazole and 50 1M 
JDTic). PD MiniTrap G-25 column (GE healthcare) was used to remove imidazole. 
The protein was then treated overnight with His-tagged AcTEV protease 
(Invitrogen) to cleave the N-terminal His-tag and Flag-tag. ACTEV protease and 
cleaved N-terminal fragment were removed by TALON IMAC resin incubation at 
4°C for 2h for binding. The tag-less protein was collected as the TALON 
IMAC column flow-through. The protein was then concentrated to 40 mg ml! 
with a 100 kDa molecular weight cut-off Vivaspin centrifuge concentrator (GE 
healthcare). Protein purity and monodispersity were tested by SDS-PAGE and 
analytical size-exclusion chromatography (aSEC). Typically, the protein purity 
exceeded 95%, and the aSEC profile showed a single peak, indicative of receptor 
monodispersity. 

Lipidic cubic phase crystallization. Protein samples of k-OR in complex with 
JDTic were reconstituted into lipidic cubic phase (LCP) by mixing with molten 
lipid in a mechanical syringe mixer‘. LCP crystallization trials were performed 
using an NT8-LCP crystallization robot (Formulatrix) as previously described®. 
96-well glass sandwich plates (Marienfeld) were incubated and imaged at 20 °C 
using an automated incubator/imager (RockImager 1000, Formulatrix). Initial 
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crystal hits were found from precipitant condition containing 100mM 
sodium citrate pH 6.0, 30% (v/v) PEG400, 400 mM potassium nitrate. After 
extensive optimization, crystals of 301m X 10 tm X 5m to 60 ptm X 20 um 
Xx 101m size were obtained in 100mM sodium citrate pH 5.8-6.4, 28-32% 
(v/v) PEG400, 350-450 mM potassium nitrate. Crystals were harvested directly 
from LCP matrix using MiTeGen micromounts and flash frozen in liquid 
nitrogen. 

Data collection, structure solution and refinement. X-ray data were collected at 
the 23ID-B/D beamline (GM/CA CAT) at the Advanced Photon Source, 
Argonne, using a 10j1m minibeam at a wavelength of 1.0330A and a 
MarMosaic 300 CCD detector. Most crystals were invisible after flash freezing 
in liquid nitrogen, and a similar alignment and data-collection strategy was 
followed as previously described**. Among the several hundred crystal samples 
screened, most crystals diffracted to 2.8-3.5 A resolution when exposed to 1-5 s of 
unattenuated beam using 1° oscillation. Data collection was limited to 5-10 
frames per crystal, due to the fast onset of radiation damage in the microcrystals. 
Data were integrated, scaled and merged using HKL2000*. A 97% complete data 
set of k-OR-T4L/JDTic (space group P2;2)2)) at 2.9 A resolution was obtained by 
merging data collected from 60 crystals. Initial phase information was obtained 
by molecular replacement with the program PHASER“ using two independent 
search models of the polyalanine seven-transmembrane o-helices of CXCR4- 
IT1t (PDB accession 30DU) and ensemble T4L models of B.-AR-T4L (PDB 
accession 2RH1), Az;,AR-T4L (PDB accession 3EML), CXCR4-T4L (PDB 
accession 30DU), D3R-T4L (PDB accession 3PBL) and H,R-T4L (PDB acces- 
sion 3RZE). Electron density refinement was performed with REFMAC5”, 
autoBUSTER’, and PHENIX” followed by manual examination and rebuilding 
of the refined coordinates in the program COOT™ using both |2F,| — |F.| and 
|F.| — |F,| maps, as well as omit maps. The final model includes 287 residues of A 
chain (Ser 55-Gly 261, Arg 263-Ser 301, Ala 307—Pro 347) and 288 residues of B 
chain (Ser 55-Gly 261, Arg 263-Gly 300, Ser 305-Pro 347) of the k-OR, and resi- 
dues Asn 2-Tyr 161 of both A and B chains of T4L. 

Ligand-binding assay. Membrane preparations, radioligand binding assays 
using *H-diprenorphine and data analyses were performed as previously 
described”. 

Modelling of high-affinity analogues of JDTic and morphine. Docking of high- 
affinity k-OR-specific ligands was performed using an all-atom flexible receptor 
docking algorithm in ICM-Pro (MolSoft LLC) molecular modelling package as 
described previously**. Internal coordinate (torsion) movements were allowed in 
the side chains of the binding pocket, defined as residues within 10 A distance of 
JDTic in the crystal structure. Other side chains and the backbone of the protein 
were kept as in the crystal structure. An initial conformation for each of the ligands 
was generated by Cartesian optimization of the ligand model in Merck Molecular 
Force Field. Docking was performed by placing the ligand in a random position 
within 5 A from the entrance to the binding pocket and global conformational 
energy optimization of the complex*’””. To facilitate side-chain rotamer switches 
in flexible k-OR models, the first 10° steps of the Monte Carlo (MC) procedure 
used ‘soft’ van der Waals potentials and high MC temperature, followed by another 
10° steps with ‘exact’ van der Waals method and gradually decreasing temperature. 
A harmonic ‘distance restraint applied between an amino group of the ligand and 
carboxyl of Asp 138 side chain in the initial 10° steps was removed in the final 10° 
steps. At least 10 independent runs of the docking procedure were performed for 
each k-OR ligand. The docking results were considered ‘consistent’ when at least 
80% of the individual runs resulted in conformations clustered within a r.m.s.d. of 
<1 A to the overall best energy pose of the ligand. All calculations were performed 
on a 12-core Linux workstation. 

Modelling of RB-64. Modelling of RB-64 was performed using SYBYL-X 1.3 
(Tripos) and GOLD 5.1 (Cambridge Crystallographic Data Centre)*. Default 
parameters were used except where noted. The structures of RB-64 and its 
k-OR complexes were energy minimized using the Tripos Force Field 
(Gasteiger—Hiickel charges, distance-dependent dielectric constant ¢ = 4, non- 
bonded interaction cut-off=8A, energy gradient termination = 0.05 kcal/ 
(mol X A)). The «-OR C3157°8 x torsion angle was modified (y; = +60.0°), 
orienting the sulfhydryl group towards the binding cavity. A docking distance 
constraint was used (C315 SG atom to thiocyanate sulphur atom distance, 2.0- 
6.0 A; spring constant = 5.0). The Q115”°°, D138**?, 1290°*, 1294°°°, Y31376 
and 1316””° side chains were allowed to flex via rotamer library. The GoldScore 
fitness function was used with early termination disabled for 30 genetic algorithm 
runs. Poses were selected based on their GoldScore and ability to explain the 
relevant observed biochemical data. Stereochemical quality was assessed using 
PROCHECK. 
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Opium is one of the world’s oldest drugs, and its derivatives morphine and codeine are among the most used clinical 
drugs to relieve severe pain. These prototypical opioids produce analgesia as well as many undesirable side effects 
(sedation, apnoea and dependence) by binding to and activating the G-protein-coupled p-opioid receptor (u-OR) in 
the central nervous system. Here we describe the 2.8 A crystal structure of the mouse p-OR in complex with an 
irreversible morphinan antagonist. Compared to the buried binding pocket observed in most G-protein-coupled 
receptors published so far, the morphinan ligand binds deeply within a large solvent-exposed pocket. Of particular 
interest, the p-OR crystallizes as a two-fold symmetrical dimer through a four-helix bundle motif formed by 
transmembrane segments 5 and 6. These high-resolution insights into opioid receptor structure will enable the 
application of structure-based approaches to develop better drugs for the management of pain and addiction. 


Opium extracts from the plant Papaver somniferum have been used 
for therapeutic and recreational purposes for thousands of years. 
Opioid alkaloids and related pharmaceuticals are the most effective 
analgesics for the treatment of acute and chronic pain. They also 
represent one of the largest components of the illicit drug market 
worldwide, generating revenue of approximately $70 billion in 
2009, much of which supports crime, wars and terrorism (UNODC 
World Drug Report 2011). Intravenous use of opioid drugs is a lead- 
ing cause of death by overdose in Europe and North America, and a 
major contributing factor to the worldwide AIDS epidemic. 

Morphine and codeine are the main active opioid alkaloids in opium. 
In humans, they act on the central nervous system to produce a wide 
range of effects including analgesia, euphoria, sedation, respiratory 
depression and cough suppression, and have peripheral effects such 
as constipation’. Gene disruption studies in mice show that the target 
for the majority of the effects of opioid alkaloids, whether beneficial or 
adverse, is the 1-OR*. The pt-OR belongs to the y subfamily of class A 
G-protein-coupled receptors (GPCRs) with two closely related family 
members known as the 5- and k-opioid receptors*. The 1-OR con- 
stitutes the main opioid target for the management of pain, acute 
pulmonary oedema, cough, diarrhoea and shivering’. However, opioid 
drugs are highly addictive, with the acetylated form of morphine, 
heroin, being the best-known example. Because of this, the clinical 
efficacy of opioid drugs is often limited by the development of tolerance 
and dependence. 

Although both beneficial and adverse effects are attributable to 
activation of the u-OR, they seem to be mediated by different down- 
stream signalling and regulatory pathways. The t}-OR couples pre- 
dominantly to Gi, the inhibitory G protein for adenylyl cyclase. 1)-OR 
signalling through Gi is responsible for its analgesic properties’. After 
activation, the u-OR undergoes phosphorylation and subsequently 
couples to arrestins, which have both regulatory and signalling func- 
tions”. Studies suggest that ligands with the greatest addictive poten- 
tial, such as morphine, promote interactions with Gi more strongly 
than they promote interactions with arrestins®. These studies suggest 


that it may be possible to develop safer and more effective therapeutic 
agents targeting the p-OR. 

To understand better the structural basis for 1.-OR function, we per- 
formed a crystallographic study of this receptor using the T4 lysozyme 
(T4L) fusion protein strategy developed previously’ (Supplementary 
Fig. 1). Using the in meso crystallization method, we obtained crystals 
and collected diffraction data from 25 crystals of Mus musculus 1-OR- 
TAL protein bound to the irreversible morphinan antagonist B-funal- 
trexamine (B-FNA). The structure was solved by molecular replacement 
from a 2.8 A data set. 


Transmembrane architecture 

The lattice for the -OR receptor shows alternating aqueous and 
lipidic layers with receptors arranged in parallel dimers tightly asso- 
ciated through transmembrane (TM) helices 5 and 6. More limited 
parallel interdimeric contacts through TM1, TM2 and helix 8 are 
observed between adjacent dimers (Supplementary Fig. 2). 

As in other GPCRs, the structure of the }1-OR consists of seven TM 
a-helices that are connected by three extracellular loops (ECL1-3) 
and three intracellular loops (ICL1-3) (Fig. 1a). TM3 is connected to 
ECL2 by a conserved disulphide bridge between C140°*° (super- 
scripts indicate Ballesteros—Weinstein numbers*) and C217. The 
morphinan ligand B-FNA (Fig. 1b, c) makes contacts with TM3, 
TM5, TM6 and TM7 (Fig. 1a), and the electron density observed in 
the structure confirms previous data identifying the K233°°? side 
chain as the site of covalent attachment” (Fig. 1c and Supplemen- 
tary Fig. 3). 

The intracellular face of the j1-OR closely resembles rhodopsin with 
respect to the relative positions of TM3, TM5 and TM6 (Supplemen- 
tary Fig. 4). Nevertheless, like the B.-adrenergic receptor (B2-AR), 
there is no ionic bridge between the DRY sequence in TM3 and the 
cytoplasmic end of TM6. As with the B,-AR, R165°°° forms a salt 
bridge with the adjacent D164°” of the DRY sequence. D164°* also 
engages in a polar interaction with R179 in ICL2, a feature that is 
similar to an interaction observed between D130°*? and $143 in ICL2 
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Figure 1 | Overall view of the n-OR structure. a, Views from within the 
membrane plane (left), extracellular side (top) and intracellular side (bottom) 
show the typical seven-pass transmembrane GPCR architecture of the ,t-OR. 
The ligand, B-FNA, is shown in green spheres. b, The chemical structure of 


of the B,-AR (Supplementary Fig. 4). In the -OR, it has been shown 
that the mutation of T279°** to a lysine results in a constitutively 
active receptor’’. This may be explained by a polar interaction 
observed in the crystal structure of the u-OR between T279°** and 
R165°°° (Supplementary Fig. 4). This interaction may stabilize the 
receptor in an inactive state. 


90%) | 90%) | 


u-OR M3 receptor 


Figure 2 | Comparison of ligand-binding pockets. a, b, The binding pocket 
of the L!-OR (a) is wide and open above the ligand, in stark contrast to the deeply 
buried binding pocket of the muscarinic receptors, as exemplified by the M3 

receptor (b). c, Top, the small-molecule antagonist IT1t (magenta) occupies a 
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‘i Morphine 


an 


TM7 


Helix 8 


TM4 


morphine. c, The chemical structure of B-FNA and the chemical reaction with 
the side chain of K233°*? in the receptor are shown. B-ENA is a semisynthetic 
opioid antagonist derived from morphine, shown in b. 


An exposed ligand-binding pocket 

In most available GPCR structures, the ligand is partially buried within 
the helical bundle by more superficial residues in TM segments and 
ECL2. The most extreme examples are the M2 and M3 muscarinic 
receptors’, in which the ligand is covered with a layer of tyrosines 
(Fig. 2). This provides a structural basis for the very slow dissociation 


binding pocket closer to the extracellular surface of CXCR4 than B-FNA in 
u-OR. Bottom, B-FNA is positioned more similarly to the distantly related 
aminergic receptors for the binding site of carazolol (yellow) in the B.-AR. 


©2012 Macmillan Publishers Limited. All rights reserved 


kinetics of muscarinic antagonists. For example, the dissociation half- 
life of the clinically used drug tiotropium at the M3 receptor is 34.7h 
and its dissociation constant (Kg) is 40 pM (ref. 13). By contrast, the 
binding pocket for B-FNA in the l-OR is largely exposed to the extra- 
cellular surface (Fig. 2a). This may explain why extremely potent 
opioids such as buprenorphine, with an inhibition constant (K;) of 
740 pM, diprenorphine (K; 72 pM), alvimopan (K; 350pM) and 
etorphine (K; 230pM) present rapid dissociation half-lives of 
44 min, 36 min, 30 min“ and less than 1 min (ref. 15), respectively. 
Therefore, although the affinity of high-affinity opioid ligands is com- 
parable to tiotropium, the dissociation kinetics are considerably dif- 
ferent. This feature of opioid ligands may explain why heroin overdoses 
are rapidly reversible by naloxone’®. In addition, the extremely high 
potency and fast kinetics of etorphine agonism and diprenorphine 
antagonism allows for a system that is capable of rapid anaesthesia 
and prompt reversal in veterinary use. As a result, etorphine is a pre- 
ferred anaesthetic (dose in the range of 5-20pgkg *) for valuable 
racehorses and for captive and free-ranging mammals”. 

The t-OR belongs to a subgroup of peptide GPCRs, and the closest 
published structure is that of the CKCR4 chemokine receptor’® (root 
mean squared deviation (r.m.s.d.) value of 1.35 A). In the u-OR the 
morphinan ligand B-FNA binds much more deeply than the small- 
molecule CXCR4 antagonist IT1t and occupies a similar position as 
agonists and antagonists for the §.-AR (r.m.s.d. value of 1.52 A) and 
other monoamine receptors (Fig. 2c). 


Figure 3 | Structural basis for morphinan ligand binding to the p-OR. 

a, Side view of the ligand-binding pocket with polar interactions shown. TM6 is 
excluded from this view. The electron density used to position interacting side 
chains is shown in light blue coloured mesh depicting the 2F, — F, electron 
density contoured at 1.30. Green mesh depicts an omit map of B-FNA and 
K233°*? side-chain atoms contoured at 3.0c. b, Binding pocket viewed from 
the extracellular surface. Water molecules are shown as red spheres, with the 
accompanying electron density shown in light blue mesh. c, The binding site is 
diagrammed, showing the chemical structure of B-FNA (green) covalently 
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Binding pocket and opioid specificity 

There are 14 residues within 4 A of B-FNA. Nine of these have more 
direct interactions with the ligand (Fig. 3a—c), and are conserved in the 
k-OR and 5-OR. D147°*? engages in a charge-charge interaction with 
the amine moiety of the ligand and hydrogen bonds with Y326’*? (both 
residues are strictly conserved in all the opioid receptor subtypes). 
Although D147°*? occupies the same position as the counterion in 
aminergic receptors, a sequence comparison shows that it is not con- 
served in other peptide receptors. H297°”? interacts with the aromatic 
ring of the morphinan group, but does not directly hydrogen bond with 
B-FNA as has been previously suggested’*. However, the electron 
density suggests the presence of two water molecules that are well 
positioned to form a hydrogen-bonding network between H297°°? 
and the phenolic hydroxyl of the morphinan group (Fig. 3b, c). 

A direct comparison with the 6-OR sequence also shows that of the 
14 residues within 4 A of the ligand, 11 are identical between 1-OR 
and 8-OR. The three differences are at t-OR positions E229°°'”, 
K303°°* and W318’°°, which are Asp, Trp and Leu in the 8-OR, 
respectively. The substitution of leucine in 5-OR for W318’*? is high- 
lighted in Fig. 3d. W318’*° was shown to be responsible for the 
binding selectivity of naltrindole, a 6-OR-selective antagonist and of 
[D-Pen2,p-Pen5]enkephalin (DPDPE), a 65-OR-selective peptide 
agonist’. In particular, the point mutation W318L markedly 
increases the affinity of both these ligands at the 11-OR. Positioning 
naltrindole (represented in Fig. 3d) into the 1-OR-binding pocket by 
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bound to the receptor through K233°~” (bold). Hydrophobic interactions are 
shown in orange and polar contacts with red dotted lines. V300°°° and 1296°°" 
form extensive hydrophobic contacts with the back face of the ligand (not 
shown). Two water molecules are positioned between H297°°? and the 
phenolic group of B-FNA. d, The 5-OR-selective ligand naltrindole includes an 
indole group that would clash with W318”*° in 1-OR, but not with the leucine 
found in the equivalent position in 6-OR. The indole has been described as an 
‘address’ to target the ligand to 6-OR, whereas its efficacy (‘message’) is 
determined by the morphinan group on the left’’. 
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superimposition of its morphinan group on that of B-FNA shows that 
naltrindole would clash with the W318 side chain in !-OR (Fig. 3d), 
whereas the leucine in this position of 6-OR would probably accom- 
modate naltrindole without requiring structural rearrangement. 
Endomorphins 1 and 2 are small peptides isolated from brain that 
were shown to have the highest affinity (low nM range) and the highest 
selectivity profile for the t-OR receptor”'. For instance, endomorphin 1 
exhibits 4,000- and 15,000-fold selectivity for L-OR over 6-OR and 
«-OR, respectively”. Although little is known about the determinants 
of endomorphin binding, mutagenesis studies suggest that the ,1- 
OR-selective synthetic peptide agonist [D-Ala2,N-MePhe4,Gly-ol5] 
enkephalin (DAMGO) occupies a space that overlaps with the 
B-FNA-binding pocket but also extends beyond this site’. Sites of 
mutations that impair DAMGO binding include H297°*”, positioned 
near the bottom of the B-FNA pocket, as well as K303°°°, W318" and 
H319’*°, positioned above the B-FNA-binding pocket (Supplemen- 
tary Fig. 5). Given the residues involved in DAMGO binding to 1!-OR, 
opioid peptides probably make both polar and non-polar contacts 
within the j1-OR-binding pocket. This feature of opioid peptide bind- 
ing is also reflected in the lack of a highly charged surface within the 
ut-OR-binding pocket compared with that of the CXCR4 receptor’®. 


Oligomeric arrangement of p-OR 


The structure of |t-OR shows receptor molecules intimately associated 
into pairs along the crystallographic two-fold axis through two dif- 
ferent interfaces (Fig. 4a, b). The first interface is a more limited 
parallel association mediated by TM1, TM2 and helix 8, with a buried 
surface area of 615 A? (Fig. 4d and Supplementary Fig. 6). The second 
and more prominent interface observed in the -OR crystal structure 
is comprised of TM5 and TM6 (Fig. 4c). In this case, within each 
H-OR-L-OR pair, the buried surface area for a single protomer is 
1,492 A?. This represents 92% of the total buried surface between 
H-OR-T4L molecules, indicating that the comparatively small 
114A” buried surface contributed by T4L is unlikely to drive the 
contact (Supplementary Fig. 7). This suggests that the pairwise asso- 
ciation of receptor monomers may represent a physiological opioid 
receptor dimer or higher-order oligomer, the existence of which is 
supported by previous biochemical, pharmacological and cell bio- 
logical studies”. 

Recent computational and biochemical studies have indicated the 
potential role of TM4 and TMS in the interaction between 6-OR 
receptors. More generally, oligomers have been observed for a large 
number of GPCRs (recently reviewed in ref. 25). Some of these studies 
have shown that TM5 and TM6 peptides can disrupt dimers of the 
B.-AR and V2 vasopressin receptor**’’, and recent crosslinking 
experiments with the M3 muscarinic receptor suggest a direct dimeric 
contact mediated by TMS5 of each monomer”. The potential involve- 
ment of the alternative TM1-TM2-H8 (where H8 is helix 8) interface 
in GPCR oligomerization has previously been indicated by several 
different biochemical studies” and, more recently, by the structure 
of opsin (Protein Data Bank (PDB) accession 3CAP)”. In the case of 
opioid receptors, it has been shown that a u-OR TM1 domain fused to 
a polybasic TAT sequence could disrupt the t-OR-6-OR interaction 
in the mouse spinal cord, resulting in an enhancement of morphine 
analgesia and a reduction in morphine tolerance”. 

The more prominent interface observed in the }1-OR crystal struc- 
ture is comprised of TM5 and TM6 of each protomer arranged in a 
four-helix bundle motif (Fig. 5a). This interface is formed by an 
extensive network of interactions involving 28 residues in TM5 and 
TM6 (Fig. 5c and Supplementary Fig. 8). These surface packing inter- 
actions are highly complementary and are maintained all along the 
receptor membrane plane from the extracellular to the intracellular 
side of the j-OR (Fig. 5c, d). The T279°** residue described earlier as 
having a role in maintaining the receptor in an inactive state is also 
part of the dimer interface, with the methyl of the threonine contact- 
ing 1256” of the adjacent protomer. It is thus tempting to speculate 
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Figure 4 | p-OR oligomeric arrangement. a, b, [!-OR crystallized as 
intimately associated pairs, with two different interfaces as defined in the text. 
c, d, The interface defined by TM5 and TM6 (c) is much more extensive than 
for the one defined by TM1-TM2-H8 (d). 


that dimerization of the 1-OR could have a role in regulating receptor 
signalling. 

The observed dimer is of interest because of existing evidence for 
both homo- and heterodimers (or oligomers) involving the t-OR”’. It 
has been suggested that opioid agonists such as DAMGO and 
methadone reduce tolerance to morphine in vivo by facilitating 
morphine-induced endocytosis through p1-OR oligomerization*””’. 
These studies implicate allosteric interactions between a protomer 
bound to DAMGO or methadone and an adjacent protomer bound 
to morphine. Co-expressing [1-OR and 6-OR in cells results in phar- 
macological profiles distinct from either receptor expressed alone™. 
Of interest, morphine is more efficacious in cells expressing both 
u-OR and 6-OR in the presence of a 6-OR-selective antagonist, sug- 
gesting an allosteric interaction between t-OR and 6-OR protomers”’. 
Hetero-oligomerization between 1-OR and non-opioid receptors has 
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Figure 5 | The four-helix bundle interface. a, Schematic showing the four- 
helix bundle architecture of the TM5-TM6 interface. b, Viewed from the 
extracellular surface, the binding pocket shows tight association between the 
ligand (green sticks) and residues that are involved directly or indirectly in 
forming the dimeric interface (blue spheres). c, The four-helix bundle is 
expanded and shown in detail with interacting residues within 4.2 A shown as 
sticks. d, Tomographic representation along the dimer interface viewed from 
the extracellular side (as indicated in panel c) showing the high surface 
complementarity within the four-helix bundle interface. 


also been reported**. For example, the a2, adrenergic receptor was 
shown to modulate receptor |1-OR structure and signalling”®. 

Consistent with a role for oligomerization in 1-OR function, we 
observed that the amino acids involved in the dimer interface display a 
high degree of homology with the 6-OR (Supplementary Figs 9 and 
10). Replacing the residues of 1-OR with the corresponding residues 
from 6-OR would not be predicted to interfere with dimer formation 
(Supplementary Figs 9 and 10). This analysis also suggests that a 
u-OR-6-OR dimer could share the same interface. Interestingly, in 
the ut-OR TM5-TM6 dimer, the two binding sites are coupled 
through a network of packing interactions at the dimeric interface 
(Fig. 5b). This network could provide a structural explanation for the 
distinct pharmacological profiles obtained for t-OR heterodimers 
and for the allosteric effects of one protomer on the pharmacological 
properties of the other. This dimeric interface thus provides potential 
insights into the mechanism of allosteric regulation of one GPCR 
protomer by the other. 

Parallel dimers have also been observed in other GPCR crystal 
structures, most notably in CXCR4-T4L'*. Interestingly, the 
CXCR4 dimer is also related by a two-fold rotational symmetry axis 
with a receptor arrangement similar but not identical to that seen in 
ut-OR (Supplementary Fig. 8). However, for the five different CKCR4- 
TAL crystal structures, the largest calculated contact area between the 
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two CXCR4 protomers is smaller (1,077 A? for PDB accession 30E0) 
than in the p1-OR structure (Supplementary Fig. 7), and it presents a 
comparatively less extensive network of interactions (Supplementary 
Fig. 8). 

The dimeric arrangement of |1-OR across the TM5-TM6 interface 
observed in the crystal structure would probably preclude either 
protomer from coupling to G proteins. This is based on structural 
changes in TM5 and TM6 observed in the recent crystal structure of 
the B.-AR-G, complex”. This is also consistent with the observation 
that inverse agonists stabilize B,-AR oligomers, while the G protein G, 
reduced the extent of oligomerization*®. However, we were able to 
model an active structure of u-OR in complex with G protein based 
on the crystal structure of the B,-AR-G, complex. Here, we observed 
that a tetramer formed by the association of two dimers through a 
TM5-TM6 interface would accommodate two G proteins in inter- 
action with the two distal protomers (Supplementary Fig. 11). This 
model of an activated t-OR-G-protein oligomeric complex is highly 
speculative but is compatible with results from a recent biophysical 
study suggesting that the G-protein G, remains associated with a 
ut-OR tetramer stabilized by the agonist morphine”. 

The yt-OR is perhaps the most economically important GPCR in 
terms of the combined legal and illicit drug market. Although there are 
a number of effective drugs targeting the u-OR on the market, the ideal 
agonist has yet to be developed. The structure of the ,1-OR presented 
here provides the first high-resolution insight, to our knowledge, into a 
peptide receptor that can also be activated by small-molecule agonist 
ligands, some of which are the oldest used drugs in human history. This 
structure will enable the application of structure-based approaches to 
complement more conventional drug discovery programs. In addition, 
it may provide novel insights into the role of oligomerization in GPCR 
function. 


METHODS SUMMARY 


The t-OR-T4L fusion protein was expressed in Sf9 insect cells and 
purified by nickel affinity chromatography followed by Flag antibody 
affinity chromatography and size-exclusion chromatography. It was 
crystallized using the lipidic cubic phase technique, and diffraction data 
were collected at GM/CA-CAT beamline 23ID-D at the Advanced 
Photon Source at Argonne National Laboratory. The structure was 
solved by molecular replacement using merged data from 25 crystals. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


Expression and purification. Previously crystallized GPCRs show little density 
for the poorly ordered amino- and carboxy-terminal domains. Although these 
domains are not critical for maintaining high ligand affinity, these flexible regions 
may inhibit crystallogenesis’. We therefore removed these regions in the receptor 
construct used for crystallography. Specifically, a TEV protease recognition site 
was introduced after reside G51 in the amino terminus and the C terminus was 
truncated after Q360. The short third intracellular loop of 1-OR, consisting of 
residues 264-269, was replaced with T4L residues 2-161 in a manner described 
previously’. To facilitate receptor purification, a Flag M1 tag was added to the N 
terminus and an octa-histidine tag was appended to the C terminus. Finally, a 
proline residue was introduced N-terminal to the octahistidine tag to allow effi- 
cient removal of C-terminal histidines by carboxypeptidase A. For these studies, 
we used the M. musculus \-OR sequence because it is expressed at higher levels. 
The mouse and human p1-OR share 94% sequence identity and there are only four 
residues in the resolved part of the structure that differ between the mouse and 
human pt-OR. These include residues 66, 137, 187 and 306, which are all in the 
extracellular or intracellular loops of t-OR and do not make contacts in the 
ligand-binding pocket. The final crystallization construct ({1-OR-T4L) is shown 
in a representative snake diagram in Supplementary Fig. la. 

We compared the pharmacological properties of -OR-T4L to those of the 
wild-type receptor (Supplementary Fig. 1b). Both constructs showed identical 
affinity for the radiolabelled antagonist [*H]-diprenorphine ({*H]DPN). 

The !-OR-T4L construct was expressed in Sf9 cells using the baculovirus 
system. Culture media was supplemented with 10 UM naloxone to stabilize the 
receptor during expression. Cells were infected at a density of 4 X 10° cells per ml 
and culture flasks were shaken at 27 °C for 48 h. After harvesting, cells were lysed 
by osmotic shock in a buffer comprised of 10 mM Tris-HCl pH 7.5, 1 mM EDTA, 
100 1M TCEP, 1 1M naloxone and 2mgml ' iodoacetamide to block reactive 
cysteines. Extraction of t-OR-T4L from Sf9 membranes was done with a Dounce 
homogenizer in a solubilization buffer comprised of 0.5% dodecyl maltoside 
(DDM), 0.3% 3-[(3-Cholamidopropyl) dimethylammonio]-1-propanesulphonate 
(CHAPS), 0.03% cholesterol hemisuccinate (CHS), 20 mM HEPES pH 7.5, 0.5M 
NaCl, 30% v/v glycerol, 2mg ml! iodoacetamide, 100 uM TCEP and 1uM 
naloxone. After centrifugation, nickel-NTA agarose was added to the supernatant, 
stirred for 2h, and then washed in batch with 100g spins for 5 min each with a 
washing buffer of 0.1% DDM, 0.03% CHAPS, 0.01% CHS, 20 mM HEPES pH 7.5 
and 0.5 M NaCl. The resin was poured into a glass column and bound receptor was 
eluted in washing buffer supplemented with 300 mM imidazole. 

We used anti-Flag M1 affinity resin to purify 1-OR-T4L further and to 
exchange the ligand with the covalent antagonist B-FNA. Nickel-resin eluate 
was loaded onto anti-Flag M1 resin and washed extensively in the presence of 
10 1M B-FNA. The detergent DDM was then gradually exchanged over 1h into a 
buffer with 0.01% lauryl maltose neopentyl glycol (MNG) and the NaCl concen- 
tration was lowered to 100mM. Receptor was eluted from the anti-Flag M1 
affinity resin with 0.2 mg ml ' Flag peptide and 5mM EDTA in the presence 
of 14M B-FNA. To remove the N terminus of {t-OR-T4L, TEV protease was 
added at 1:3 w/w (TEV:u-OR-T4L) and incubated at room temperature (23 °C) 
for 1h. Receptor was then treated with carboxypeptidase A (1:100 w/w) and 
incubated overnight at 4°C to remove the octa-histidine tag. The final purifica- 
tion step separated TEV and carboxypeptidase A from receptor by size exclusion 
chromatography on a Sephadex S200 column (GE Healthcare) in a buffer of 
0.01% MNG, 0.001% CHS, 100mM NaCl, 20mM HEPES pH 7.5 and 1M 
B-FNA. After size exclusion, B-FNA was added to a final concentration of 10 1M. 
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The resulting receptor preparation was pure and monodisperse (Supplementary 
Fig. 12). 

Crystallization and data collection. Purified 1.-OR-T4L receptor was concen- 
trated to 30mgml’ using a Vivaspin sample concentrator with a 50kDa 
molecular weight cut-off (GE Healthcare) and crystallization was performed 
using the in meso method*!. Concentrated 1-OR-T4L was reconstituted into 
10:1 monoolein:cholesterol (Sigma) in a ratio of 1:1.5 parts by weight receptor:lipid 
mixture. Reconstitution was done by the two-syringe method”. The resulting 
mesophase was dispensed onto glass plates in 80-nl drops and overlaid with 
700 nl precipitant solution by a Gryphon LCP robot (Art Robbins Instruments). 
Crystals grew in precipitant solution consisting of 30-38% PEG 400, 100mM 
HEPES pH 7.0, 7.5% DMSO and 300 mM lithium sulphate. Crystals were observed 
after 24h and grew to full size after 5 days. Typical crystals before harvesting are 
shown in Supplementary Fig. 2. 

Diffraction data were collected at Advanced Photon Source GM/CA-CAT 

beamline 23ID-D using a beam size of 10 um. Owing to radiation damage, the 
diffraction quality decayed during exposure. Wedges of 10-20 degrees were col- 
lected and merged from 25 crystals using HKL2000”. Diffraction quality ranged 
from 2.4-3.5 A in most cases. The structure of the {t-OR was solved by molecular 
replacement in Phaser* using the CXCR4 receptor as a search model. We 
improved the initial model by iteratively building regions of the receptor in 
Coot™ and refining in Phenix*. We used translation libration screw-motion 
(TLS) refinement with groups generated within Phenix. Electron density sug- 
gested the presence of a cholesterol molecule and a monoolein lipid within the 
lipidic layer. These were subsequently incorporated into the model. To assess the 
overall quality of the final structure, we used MolProbity”*. The resulting statistics 
for data collection and refinement are shown in Supplementary Table 1. Figures 
were prepared in PyMOL”. 
Saturation binding experiments. Membrane homogenates were prepared 
from Sf9 cells expressing either wild-type [1-OR or 1-OR-T4L. Membranes con- 
taining 1-OR or t-OR-TAL were incubated with the opioid antagonist, [7H] DPN 
for 1h at 22 °C in 0.5 ml of binding buffer containing 75 mM Tris-HCl pH 7.4, 
1mM EDTA, 5mM MgCl, 100mM NaCl. To determine the affinity for 
diprenorphine, we used [*H]DPN concentrations ranging from 0.1 to 13.5 nM. 
High concentrations of un-labelled naloxone (1 [.M) were used to determine non- 
specific binding. To separate unbound [°H]-ligand, binding reactions were 
rapidly filtered over GF/C Brandel filters. The filters were then washed three 
times with 5 ml ice-cold binding buffer. Radioactivity was assayed by liquid 
scintillation counting. The resulting data were analysed using Prism 5.0 
(GraphPad Software). (H]DPN (specific activity: 55.0 Ci mmol ') was obtained 
from PerkinElmer Life Sciences. 
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QUANTUM OPTICS 


An entangled walk of photons 


By harnessing the quantum nature of light and guiding the light through a network of circuits integrated in a glass chip, 
it is possible to mimic fundamental particles undergoing a quantum walk. 


JONATHAN C. F. MATTHEWS 
& MARK G. THOMPSON 


uantum mechanics is the most success- 

ful model of nature that we have, 

accurately describing fundamental 
physical processes. Although quantum effects 
such as entanglement and superposition 
are counterintuitive and often described as 
‘spooky, they are being observed and charac- 
terized in laboratories worldwide. Writing in 
Physical Review Letters, Sansoni et al.' describe 
how they have designed and built an optical 
network, integrated in a glass chip, that manip- 
ulates photons to simulate a process known as 
a discrete-time quantum walk. Furthermore, 
using a particular kind of entanglement, the 
authors simulated different classes of funda- 
mental particles undergoing the quantum- 
walk process’. The results are a step towards 
the development of quantum-mechanical 
machines that promise to outperform conven- 
tional supercomputers, which operate accord- 
ing to the laws of classical physics. 

Perhaps the most far-reaching applications 
of quantum technologies will be based on the 
American physicist Richard Feynman's pro- 
posal that an efficient way to simulate one 
quantum system is to use another. Such simu- 
lators could be used to study complex quantum 
systems that are computationally hard to simu- 
late or difficult to physically control. We are 
currently witnessing initial demonstrations of 
these simulators in the form of purpose-built 
quantum devices that mimic other, less readily 
accessible quantum systems. 

Developments in quantum simulation are 
still at the stage of mimicking systems simple 
enough to be handled with classical comput- 
ers. However, photonics is an attractive candi- 
date for reaching the point at which quantum 
machines could outperform state-of-the-art 
supercomputers for particular tasks. This is 
partly thanks to the complexity and stability 
of quantum networks realized with integrated 
optics, and to the nature of multiple identical 
photons interfering in a sufficiently compli- 
cated optical network’. 

In classical physics, the analogue of Sansoni 
and colleagues’ discrete-time quantum walk 
would be a form of random walk, a tool 
already used in a broad range of fields from 


Time 


Figure 1 | Classical and quantum walks. a, Ina 
classical random walk, a walker must make a choice 
(randomly) of moving either left or right at each 
step. After many trials of a fixed number of steps, 
the walker is most probably found close to its initial 
starting point. b, In a quantum walk, the walker 
uses a ‘quantum coin mechanism that allows it to 
move ina superposition of both left and right. The 
probability distribution of its position after many 
steps is therefore starkly different from that ofa 
classical random walk, with the walker most likely 
to be found far from its initial starting point. 


animal behaviour to economics, for example 
in modelling the path ofa foraging animal or 
fluctuating prices in the stock market. The 
simplest random walk is on a line, and is con- 
ducted by repeatedly flipping a coin and walk- 
ing left if the coin lands on heads, and right 
if it lands on tails — on average, the walker 
remains close to where it started. By contrast, 
if the walker is endowed with quantum prop- 
erties (for example, in the case ofa photon), it 
moves in a superposition of both left and right 
at each coin flip, with these multiple possibili- 
ties undergoing interference and giving rise to 
completely different behaviour from that of a 
classical random walk (Fig. 1). 

Such quantum walks were initially conceived 
as an abstract computer-science model for 
developing ideas in quantum computation. 
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But now researchers are interested in using 
quantum walks to mimic real systems that 
exhibit local coupling. Examples of such sys- 
tems are materials in which particles with spin 
are coupled or excitons travelling around large 
molecules (an exciton is a particle-like entity 
formed by an electron and the ‘hole’ left by a 
missing electron). 

So far, all demonstrations of quantum walks 
(including those based on cold atoms, nuclear 
magnetic resonance and photonics) have side- 
stepped the requirement of a quantum com- 
puter by constructing purpose-built systems 
that exhibit quantum-walk dynamics. These 
demonstrations have allowed tests of quantum 
phenomena to be performed ahead of building 
a full-scale quantum computer. The first walks 
to be realized were of single quantum walkers, 
which display classical wave-like behaviour’. 
Multiple quantum walkers lead to behaviour 
that cannot be explained with wave phenom- 
ena alone, and have been reported” using two 
indistinguishable photons launched into inte- 
grated optical circuits. Sansoni and colleagues’ 
demonstration’ goes further by entangling the 
walkers in a discrete-time quantum-walk net- 
work’ achieved using an integrated network of 
waveguide beam splitters. 

The particles of choice for Sansoni et al. are 
photons, which show great promise for quan- 
tum technologies’. They are readily manipu- 
lated at the single-photon level and experience 
negligible noise — in effect, they ‘leak very 
little quantum information to the environ- 
ment. Working with photons also allows the 
deployment of integrated quantum photonics® 
to make reconfigurable, complex miniature 
circuits that are inherently stable within frac- 
tions of an optical wavelength (about 10 nano- 
metres)’. The authors fabricated their circuits 
using a laser ‘direct-write’ technique that 
allows three-dimensional optical circuits to be 
written into a solid block of borosilicate glass 
using a tightly focused beam of laser pulses”. 

Sansoni and colleagues’ optical network’ 
consisted of ten waveguide beam splitters at 
discrete points in a centimetre-scale glass cir- 
cuit. Each beam splitter provided the ‘quantum 
coin’ mechanism that repeatedly placed the 
photons in quantum superposition as they pro- 
gressed through the network (Fig. 2). More- 
over, thanks to the type of entanglement they 
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Figure 2 | Discrete quantum walk ona photonic chip. Sansoni et al.’ fabricated a network of ten 
waveguide beam splitters in a borosilicate-glass chip using a focused pulsed laser (not shown) and 
launched two entangled photons (spheres) into the network. The photons underwent a quantum walk 
and were detected at the chip’s output. The output depends on the symmetry of the entanglement between 
the photons. A symmetric entangled input leads to boson-like behaviour (the photons can occupy the 
same waveguide and so can exit the waveguides together), whereas an antisymmetrically entangled input 
yields fermion-like behaviour (the photons cannot occupy the same waveguide and therefore exit through 
separate waveguides). The size of the spheres at the output illustrates the probability of the photons 
leaving each of the waveguides, with a bigger size corresponding to a larger probability. 


used and the fact that their device preserves 
photon polarization, the authors were able to 
simulate quantum walks not just for photons 
(and therefore the boson class of particles) but 
also fundamentally different classes of parti- 
cles (the fermion and intermediate-type fami- 
lies). This approach can be generalized for any 
quantum network (not just quantum walks) 
and can be scaled-up to simulate any number 
of particles’. 

The scalability of integrated-optics circuits 
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allows for incredible flexibility: waveguide 
networks, consisting of hundreds of wave- 
guides and beam splitters, could be imple- 
mented in future experiments to simulate more 
complex dynamics than those demonstrated 
by the authors. So far, only two non-interact- 
ing quantum particles have been simulated 
undergoing relatively simple quantum walks. 
Large numbers of photons will be required for 
any practical quantum-photonic technology, 
and future developments could include the 
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simulation of interacting quantum particles 
(such as charged particles) through implemen- 
tation of photon-photon nonlinearities’ using 
auxiliary photon measurement or light-matter 
interaction, for example. 

However, a recent, and perhaps surprising, 
theoretical result’ is that a large number of 
identical photons themselves pose computa- 
tionally difficult problems due to their bosonic 
nature, even in the absence of interaction. 
Using just tens of photons guided in a circuit 
with hundreds of waveguides could lead to the 
first quantum computation competitive with 
the equivalent calculation performed with the 
best classical computers currently available. 
Sansoni and colleagues’ study is a development 
in this direction, and paves the way for using 
integrated quantum circuits to build a practical 
quantum simulator. = 
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